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(57) Abstract: The present invention relates to methods for identifying compounds that modulate premature translation termination 
and/or nonsense-mediated mRNA decay by screening and identifying compounds that modulate the post-lranscriptional expression 
of any gene with a premature translation stop codon. The invention particularly relates to using any gene encoding a premature stop 
codon to identify compounds that modulate premature translation termination and/or nonsense-mediated mRNA decay. A compound 
that modulates premature translation termination and/or nonsense-mediated mRNA decay of a taiget gene is identified using standard 
methods known in the art to measure changes in translation or mRNA stability of the gene product or mRNA of the gene with the 
premature stop codon. The methods of the present invention provide a simple, sensitive assay for high-throughput screening of 
libraries of compounds to identify phannaceutical leads. 
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METHODS FOR IDENTIFYING SMALL MOLECULES THAT MODULATE 



PREMATU RE TRANSLATION TERMIN A TION AND NONSENSE MEDIATED mRNA 

DECAY 



This application is entitled to and claims priority benefit to U.S. Provisional 
Patent Application No. 60/390,747, filed June 21, 2002, U.S. Provisional Patent Application 
5 No. 60/398,180, filed July 24, 2002 and U.S. Provisional Patent Application No. 

60/398,287, filed July 24, 2002, each of which are incorporated herein by reference in their 
entirety, 

1. INTRODUCTION 

The present invention relates to a method for screening and identifying 
10 compounds that modulate prematiu'e translation termination and/or nonsense-mediated 

messenger ribonucleic acid ("mRNA") decay by screening and identifying compounds that 
modulate the post-transcriptional expression of any gene with a premature translation stop 
codon. A compound that modulates premature translation termination and/or nonsense- 
mediated mRNA decay of a target gene is identified using standard methods known in the 
IS art to measure changes in translation or mRNA stability of the gene product or nlRNA of 
the gene with the premature stop codon. The methods of the present invention provide a 
simple, sensitive assay for high-throughput screening of libraries of compounds to identify 
pharmaceutical leads. 

2. BACKGROUND OF THE INVENTION 

20 Protein synthesis encompasses the processes of translation initiation, 

elongation, and termination, each of which has evolved to occur with great accuracy and has 
the capacity to be a regulated step in the pathway of gene expression. Recent studies, 
including those suggesting that events at termination may regulate the ability of ribosomes 
to recycle to the start site of the same mRNA, have underscored the potential of termination 

25 to regulate other aspects of translation. The RNA triplets UAA, UAG, and UGA are non- 
coding and promote translational termination. Termination starts when one of the three 
termination codons enters the A site of the ribosome signaling the polypeptide chain release 
factors to bind and recognize the termination signal. Subsequently, the ester bond between 

-1 . 



wo 2004/001010 PCT/US2003/019760 

the 3' nucleotide of the transfer RNA ("tRNA") located in the ribosome's P site and the 
nascent polypeptide chain is hydrolyzed, the completed polypeptide chain is released, and 
the ribosome subunits are recycled for another round of translation. 

Nonsense-mediated mRNA decay is a surveillance mechanism that 

5 minimizes the translation and regulates the RNA stability of nonsense RNAs that contain 
chain termination mutations (see, e.g,, Hentze & Kulozik, 1999, Cell 96:307-310; 
Culbertson, 1999, Trends in Genetics 15:74-80; Li & Wilkinson, 1998, Immunity 8:135- 
141; and Ruiz-Echevarria et aL, 1996, Trends in Biological Sciences, 21 :433-438). Chain 
termination mutations are caused by a base substitution or frameshift mutation changes a 

10 codon into a termination codon, /.e., a stop codon that causes translational termination. In 
nonsense-mediated mRNA decay, mRNAs with premature stop codons are subject to 
degradation. In some cases, a truncated protein is produced if the premature stop codon is 
located near the end of an open reading frame. 

Certain classes of known antibiotics have been characterized and found to 

1 5 interact with RNA. For example, the antibiotic thiostreptone binds tightly to a 60-mer from 
ribosomal RNA (Cundlifife et al, 1990, in The Ribosome: Structure, Function & Evolution 
(Schlessinger et aL, eds.) American Society for Microbiology, Washington, D.C. pp. 479- 
490). Bacterial resistance to various antibiotics often involves methylation at specific rRNA 
sites (Cundliffe, 1989, Ann. Rev. Microbiol. 43:207-233). Aminoglycosidic aminocyclitol 

20 (aminoglycoside) antibiotics and peptide antibiotics are known to inhibit group I intron 
splicing by binding to specific regions of the RNA (von Ahsen et aL, 1991, Nature 
(London) 353:368-370). Some of these same aminoglycosides have also been found to 
inhibit hammerhead ribozyme fimction (Stage et al, 1995, RNA 1:95-101). In addition, 
certain aminoglycosides and other protein synthesis inhibitors have been found to interact 

25 with specific bases in 16S rRNA (Woodcock et al. 1991. EMBO J. 10:3099-3103), An 
oligonucleotide analog of the 16S rRNA has also been shown to interact with certain 
aminoglycosides (Purohit et al, 1994, Nature 370:659-662). A molecular basis for 
hypersensitivity to aminoglycosides has been found to be located in a single base change in 
mitochondrial rRNA (Hutchin et al, 1993, Nucleic Acids Res. 21:4174-4179). 

30 Aminoglycosides have also been shown to inhibit the interaction between specific structural 
RNA motifs and the corresponding RNA binding protein. Zapp et al (Cell, 1993, 74:969- 
978) has demonstrated that the aminoglycosides neomycin B, lividomycin A, and 
tobramycin can block the binding of Rev, a viral regulatory protein required for viral gene 
expression, to its viral recognition element in the IIB (or RRE) region of HIV RNA. This 
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blockage appears to be the result of competitive binding of the antibiotics directly to the 
RRE RNA structural motif. 

Aminoglycosides have also been found to promote nonsense suppression 
(see, Bedwell et aL, 1997. Nat. Med. 3:1280-1284 and Howard et aL, 1996, Nat. Med. 
5 2:467-469). Nonsense mutations cause approximately 10 to 30 percent of the individual 
cases of virtually all inherited diseases. Although nonsense mutations inhibit the synthesis 
of a full length protein to one percent or less of wild-type levels, minimally boosting the 
expression levels of the full length protein to between five and fifteen percent of normal 
levels can greatly reduce the severity or eliminate the disease. Clinical approaches that 

10 target the translation termination event to promote nonsense suppression have recently been 
described for model systems of cystic fibrosis and muscular dystrophy. Gentamicin is an 
aminoglycoside antibiotic that causes translational misreading and allowed the insertion of 
amino acids at the site of the nonsense codon in models of cystic fibrosis, Hurlers 
Syndrome, and muscular dystrophy (see, e.g., Barton-Davis et aL^ 1999, J. Clin. Invest. 

15 104:375-381). These results strongly suggest that drugs that promote nonsense suppression 
by altering translation termination efficiency of a premature termination codon can be 
therapeutically valuable in the treatment of diseases caused by nonsense mutations. 

Citation or identification of any reference in Section 2 of this application is 
not an admission that such reference is available as prior art to the present invention. 

20 3. SUMMARY OF THE INVENTION 

The present invention provides methods for identifying a compound that 
modulates premature translation termination and/or nonsense-mediated mRNA decay. In 
particular, the invention provides methods for identifying a compound that suppresses 
premature translation termination and/or nonsense-mediated mRNA decay. The invention 

25 encompasses the use of the compounds identified utilizing the methods of the invention for 
the prevention, treatment, management or amelioration of a disorder associated with, 
characterized by or caused by premature translation termination and/or nonsense-mediated 
mRNA decay, or a symptom thereof. 

The invention provides cell-based and cell-free assays for the identification 

30 of a compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay. In general, the level of expression of a reporter gene product past the 
premature termination codon and/or activity of such a gene product in the reporter gene 
based-assays described herein is indicative of the effect of the compound on premature 
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translation tennination and/or nonsense-mediated mRNA decay. The reporter gene-based 
assays described herein for the identification of compounds that modulate premature 
translation termination and/or nonsense-mediated mRNA decay are well suited for high- 
throughput screening. 

5 The reporter gene cell-based assays may be conducted by contacting a 

compound with a cell containing a nucleic acid sequence comprising a reporter gene, 
wherein the reporter gene comprises a premature stop codon or nonsense mutation, and 
measuring the expression of the reporter gene. The reporter gene cell-based assays may 
also be conducted by: (a) contacting a compound with a cell containing a first nucleic acid 

1 0 sequence and a second nucleic acid sequence, wherein the first nucleic acid sequence 

comprises a regulatory element operably linked to a reporter gene and the second nucleic 
acid sequence comprises a nucleotide sequence with a premature stop codon or nonsense 
mutation that encodes a regulatory protein that binds to the regulatory element of the first 
nucleic acid sequence and regulates the expression of the reporter gene; and (b) measuring 

15 the expression of the reporter gene. 

The reporter gene cell-based assays may also be conducted by: (a) contacting 
a compound with a cell containing a first nucleic acid sequence, a second nucleic acid 
sequence and a third nucleic acid sequence, wherein (i) the first nucleic acid sequence 
comprises a nucleotide sequence encoding a first fiision protein comprising a DNA binding 

20 domain and a first protein, the nucleotide sequence of the DNA binding domain or the first 
protein containing a premature stop codon or nonsense mutation, (ii) the second nucleic acid 
sequence comprises a nucleotide sequence encoding a second fiision protein comprising an 
activation domain and a second protein, the second protein interacting with the first protein 
to produce a regulatory protein, and (iii) the third nucleic acid sequence comprises a 

25 regulatory element operably linked to a reporter gene, the expression of the reporter gene 
being regulated by the binding of the regulatory protein to the regulatory element: and (b) 
measuring the expression of the reporter gene. Further, the reporter gene cell-based assays 
maybe conducted by: (a) contacting a compound with a cell containing a first nucleic acid 
sequence, a second nucleic acid sequence and a third nucleic acid sequence, wherein (i) the 

30 first nucleic acid sequence comprises a nucleotide sequence encoding a first fiision protein 
comprising a DNA binding domain and a first protein, (ii) the second nucleic acid sequence 
comprises a nucleotide sequence encoding a second fiision protein comprising an activation 
domain and a second protein, the nucleotide sequence of the activation domain or the 
second protein containing a premature stop codon or nonsense mutation, and the second 

-4- 
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protein interacting with the first protein to produce a regulatory protein, and (iii) the third 
nucleic acid sequence comprises a regulatory element operably linked to a reporter gene, the 
expression of the reporter gene being regulated by the binding of the regulatory protein to 
the regulatory element: and (b) measuring the expression of the reporter gene. 
5 The reporter gene cell-free assays may be conducted by contacting a 

compound with a cell-free extract and a nucleic acid sequence comprising a reporter gene, 
wherein the reporter gene comprises a premature stop codon or nonsense mutation, and 
measuring the expression of the reporter gene. The reporter gene cell-free assays may also 
be conducted by contacting a compound with a cell-free extract and an in vitro transcribed 

10 RNA of a reproter gene, wherein the RNA product contains a premature stop codon or 
nonsense mutation, and measuring the expression of the protein encoded by the RNA 
product. The reporter gene cell-free assays may also be conducted by: (a) contacting a 
compound with a cell-free extract, a first nucleic acid sequence and a second nucleic acid 
sequence, wherein the first nucleic acid sequence comprises a regulatory element operably 

IS linked to a reporter gene and the second nucleic acid sequence comprises a nucleotide 
sequence with a premature stop codon or nonsense mutation that encodes a regulatory 
protein that binds to the regulatory element of the first nucleic acid sequence and regulates 
the expression of the reporter gene; and (b) measuring the expression of the reporter gene. 

The reporter gene cell-free assays may also be conducted by: (a) contacting a 

20 compound with a cell-free extract, a first nucleic acid sequence, a second nucleic acid 
sequence and a third nucleic acid sequence, wherein (i) the first nucleic acid sequence 
comprises a nucleotide sequence encoding a first fiision protein comprising a DNA binding 
domain and a first protein, the nucleotide sequence of the DNA binding domain or the first 
protein containing a premature stop codon or nonsense mutation, (ii) the second nucleic acid 

25 sequence comprises a nucleotide sequence encoding a second fiision protein comprising an 
activation domain and a second protein, the second protein interacting with the first protein 
to produce a regulatory protein, and (iii) the third nucleic acid sequence comprises a 
regulatory element operably linked to a reporter gene, the expression of the reporter gene 
being regulated by the binding of the regulatory protein to the regulatory element: and (b) 

30 measuring the expression of the reporter gene. The reporter gene cell-free assays may also 
be conducted by: (a) contacting a compound with a cell-free extract, a first nucleic acid 
sequence, a second nucleic acid sequence and a third nucleic acid sequence, wherein (i) the 
first nucleic acid sequence comprises a nucleotide sequence encoding a first fiision protein 
comprising a DNA binding domain and a first protein, (ii) the second nucleic acid sequence 
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comprises a nucleotide sequence encoding a second fusion protein comprising an activation 
domain and a second protein, the nucleotide sequence of the activation domain or the 
second protein containing a premature stop codon or nonsense mutation, and the second 
protein interacting with the first protein to produce a regulatory protein, and (iii) the third 
5 nucleic acid sequence comprises a regulatory element operably linked to a reporter gene, the 
expression of the reporter gene being regulated by the binding of the regulatory protein to 
the regulatory element; and (b) measuring the expression of the reporter gene. 

In the cell-based and cell-free reporter gene assays described herein, the 
alteration in reporter gene expression or activity relative to a previously determined 

10 reference range, or to the expression or activity of the reporter gene in the absence of the 

compound or the presence of an appropriate control (e.g., a negative control) indicates that a 
particular compound modulates premature translation teraiination and/or nonsense-mediated 
mRNA decay. In particular, a decrease in reporter gene expression or activity relative to a 
previously deteraiined reference range, or to the expression in the absence of the compound 

15 or the presence of an appropriate control {e.g., a negative control) may, depending upon the 
parameters of the reporter gene assay, indicate that a particular compound reduces or 
suppresses premature translation termination and/or nonsense-mediated mRNA decay. In 
contrast, an increase in reporter gene expression or activity relative to a previously 
detemiined reference range, or to the expression in the absence of the compound or the 

20 presence of an appropriate control (e.g., a negative control) may, depending upon the 

parameters of the reporter gene-based assay, indicate that a particular compound enhances 
premature translation termination and/or nonsense-mediated mRNA decay. 

The invention relates to the identification of compounds that modulate 
premature translation termination or nonsense-mediated mRNA decay, using, in some 

25 instances, a reporter based assay. The invention provides for the identification of 

compounds that modulated premature translation termination via a nonsense stop codon in a 
nucleic acids. Such nucleic acids include, but are not limited to, DNA and RNA. In a more 
certain embodiment, the nucleic acid is RNA. In another embodiment, the nucleic acid is 
single stranded. In other embodiments, the nucleic acids are single stranded. In yet other 

30 embodiments, the nucleic acids are more than single stranded, e.g., double, triple or 
quadruple stranded. 

In one embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) expressing a nucleic acid sequence comprising a 
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reporter gene in a cell, wherein the reporter gene comprises a premature stop codon; (b) 
contacting said cell with a member of a library of compounds; and (c) detecting the 
expression of said reporter gene, wherein a compound that modulates premature translation 
temiination and/or nonsense-mediated mRNA decay is identified if the expression of said 
reporter gene in the presence of a compound is altered relative to a previously determined 
reference range, or the expression of said reporter gene in the absence of the compound or 
the presence of an appropriate control (e.g,,. a negative control such as phosphate buffered 
saline). 

In another embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell containing a nucleic acid sequence comprising a reporter gene, wherein the 
reporter gene comprises a premature stop codon; and (b) detecting the expression of said 
reporter gene, wherein a compound that modulates premature translation temiination and/or 
nonsense-mediated mRNA decay is identified if the expression of said reporter gene in the 
presence of a compound is altered relative to a previously determined reference range, or 
the expression of said reporter gene in the absence of said compound or the presence of an 
appropriate control (e.g., a negative control). 

In another embodiment, the invention provides a method for identifying a 
compound that modulates premature translation temiination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell-free extract and a nucleic acid sequence comprising a reporter gene, wherein the 
reporter gene comprises a premature stop codon; and (b) detecting the expression of said 
reporter gene, wherein a compound that modulates premature translation termination and/or 
nonsense-mediated mRNA decay is identified if the expression of said reporter gene in the 
presence of a compound is altered relative to a previously determined reference range, or 
the expression of said reporter gene in the absence of said compound or the presence of an 
appropriate control (e.g., a negative control). In accordance with this embodiment, the cell- 
extract is preferably isolated from cells that have been incubated at about O^'C to about lO^^C 
and^or an S 10 to S30 cell-free extract. 

In another embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell containing a first nucleic acid sequence and a second nucleic acid sequence. 
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wherein the first nucleic acid sequence comprises a regulatory element operably linked to a 
reporter gene and the second nucleic acid sequence comprises a nucleotide sequence with a 
premature stop codon that encodes a regulatory protein that binds to the regulatory element 
of the first nucleic acid sequence and regulates the expression of the reporter gene; and (b) 

5 detecting the expression of the reporter gene, wherein a compound that modulates 

premature translation termination and/or nonsense-mediated mRNA decay is identified if 
the expression of said reporter gene in the presence of a compound is altered relative to a 
previously determined reference range, or the expression of said reporter gene in the 
absence of said compound or the presence of an appropriate control ie.g., a negative 

10 control). 

In another embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compotmds 
with a cell containing a first nucleic acid sequence, a second nucleic acid sequence and a 

15 third nucleic acid sequence, wherein (i) the first nucleic acid sequence comprises a 

nucleotide sequence encoding a first fiision protein comprising a DNA binding domain and 
a first protein, the nucleotide sequence of the DNA binding domain or the first protein 
comprising a premature stop codon, (ii) the second nucleic acid sequence comprises a 
nucleotide sequence encoding a second fusion protein comprising an activation domain and 

20 a second protein, the second protein interacting with the fu-st protein to produce a regulatory 
protein, and (iii) the third nucleic acid sequence comprises a regulatory element operably 
linked to a reporter gene, the expression of the reporter gene being regulated by the binding 
of the regulatory protein to the regulatory element; and (b) detecting the expression of the 
reporter gene, wherein a compound that modulates premature translation termination and/or 

25 nonsense-mediated mRNA decay is identified if the expression of said reporter gene in the 
presence of a compound is altered relative to a previously determined reference range, or 
the expression of said reporter gene in the absence of said compound or the presence of an 
appropriate control (e.g., a negative control). 

In another embodiment, the invention provides a method for identifying a 

30 compound that modulates premature translation termination and/or nonsense-mediated 

mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell containing a first nucleic acid sequence, a second nucleic acid sequence and a 
third nucleic acid sequence, wherein (i) the first nucleic acid sequence comprises a 
nucleotide sequence encoding a first fiision protein comprising a DNA binding domain and 
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a first protein, (ii) the second nucleic acid sequence comprises a nucleotide sequence 
encoding a second fusion protein comprising an activation domain and a second protein, the 
nucleotide sequence of the activation domain or the second protein containing a premature 
stop codon, and the second protein interacting with the first protein to produce a regulatory 
protein, and (iii) the third nucleic acid sequence comprises a regulatory element operably 
linked to a reporter gene, the expression of the reporter gene being regulated by the binding 
of the regulatory protein to the regulatory element; and (b) detecting the expression of the 
reporter gene, wherein a compound that modulates premature translation termination and/or 
nonsense-mediated mRN A decay is identified if the expression of said reporter gene in the 
presence of a compound is altered relative to a previously determined reference range, or 
the expression of said reporter gene in the absence of said compound or the presence of an 
appropriate control {e.g,, a negative control). 

In another embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell-free extract, a first nucleic acid sequence and a second nucleic acid sequence, 
wherein the first nucleic acid sequence comprises a regulatory element operably linked to a 
reporter gene and the second nucleic acid sequence comprises a nucleotide sequence with a 
premature stop codon that encodes a regulatory protein that binds to the regulatory element 
of the first nucleic acid sequence and regulates the expression of the reporter gene; and (b) 
detecting the expression of the reporter gene, wherein a compound that modulates 
premature translation termination and/or nonsense-mediated mRNA decay is identified if 
the expression of said reporter gene in the presence of a compound is altered relative to a 
previously determined reference range, or the expression of said reporter gene in the 
absence of said compound or the presence of an appropriate control (e.^., a negative 
control). 

In another embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell-fi-ee extract, a first nucleic acid sequence, a second nucleic acid sequence and a 
third nucleic acid sequence, wherein (i) the first nucleic acid sequence comprises a 
nucleotide sequence encoding a first fusion protein comprising a DNA binding domain and 
a first protein, the nucleotide sequence of the DNA binding domain or the first protein 
comprising a premature stop codon, (ii) the second nucleic acid sequence comprises a 
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nucleotide sequence encoding a second fusion protein comprising an activation domain and 
a second protein, the second protein interacting with the first protein to produce a regulatory 
protein, and (iii) the third nucleic acid sequence comprises a regulatory element operably 
linked to a reporter gene, the expression of the reporter gene being regulated by the binding 

5 of the regulatory protein to the regulatory element; and (b) detecting the expression of the 
reporter gene, wherein a compound that modulates premature translation termination and/or 
nonsense-mediated mRNA decay is identified if the expression of said reporter gene in the 
presence of a compound is altered relative to a previously determined reference range, or 
the expression of said reporter gene in the absence of said compound or the presence of an 

10 appropriate control (e.g. , a negative control). 

In another embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell-free extract, a first nucleic acid sequence, a second nucleic acid sequence and a 

15 third nucleic acid sequence, wherein (i) the first nucleic acid sequence comprises a 

nucleotide sequence encoding a first fusion protein comprising a DNA binding domain and 
a first protein, (ii) the second nucleic acid sequence comprises a nucleotide sequence 
encoding a second fusion protein comprising an activation domain and a second protein, the 
nucleotide sequence of the activation domain or the second protein containing a premature 

20 stop codon, and the second protein interacting with the first protein to produce a regulatory 
protein, and (iii) the third nucleic acid sequence comprises a regulatory element operably 
linked to a reporter gene, the expression of the reporter gene being regulated by the binding 
of the regulatory protein to the regulatory element; and (b) detecting the expression of the 
reporter gene, wherein a compound that modulates premature translation termination and/or 

25 nonsense-mediated mRNA decay is identified if the expression of said reporter gene in the 
presence of a compound is altered relative to a previously determined reference range, or 
the expression of said reporter gene in the absence of said compound or the presence of an 
appropriate control (e.g., a negative control). 

In accordance with the invention, the step of contacting a compound with a 

30 cell, or cell-free extract and a nucleic acid sequence in the reporter gene-based assays 

described herein is preferably conducted in an aqueous solution comprising a buffer and a 
combination of salts (such as KCl, NaCl and/or MgCh). The optimal concentration of each 
salt used in the aqueous solution is dependent on, e.g., the protein, polypeptide or peptide 
encoded by the nucleic acid sequence {e.g., the regulatory protein) and the compounds used, 

- 10- 
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and can be determined using routine experimentation. In a specific embodiment, the 
aqueous solution approximates or mimics physiologic conditions. In another specific 
embodiment, the aqueous solution further comprises a detergent or a surfactant. 

The assays of the present invention can be performed using different 
5 incubation times. In the a cell-based system, the cell and a compound or a member of a 
library of compounds may be incubated together for at least 0.2 hours, 0.25 hours, 0.5 
hours, 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 8 hours, 10 hours, 12 hours, 18 
hours, at least 1 day, at least 2 days or at least 3 days before the expression and/or activity of 
a reporter gene is measured. In a cell-free system, the cell-free extract and the nucleic acid 
10 sequence(s) (e.g.^ a reporter gene) can be incubated together before the addition of a 

compound or a member of a library of compounds. In certain embodiments, the cell-free 
extract are incubated with a nucleic acid sequence(s) (e.g.^ a reporter gene) before the 
addition of a compound or a member of a library of compounds for at least 0.2 hours, 0.25 
hours, 0.5 hours, 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 8 hours, 10 hours, 12 
15 hours, 18 hours, or at least 1 day. In other embodiments, the cell-free extract, or the nucleic 
acid sequence(s) (e.g^., a reporter gene) is incubated with a compound or a member of a 
library of compounds before the addition of the nucleic acid sequence(s) (e.g., a reporter 
gene), or the cell-free extract, respectively. In certain embodiments, a compound or a 
member of a library of compounds is incubated with a nucleic acid sequence(s) (e.g., a 
20 reporter gene) or cell-free extract before the addition of the remaining component, i.e., cell- 
free extract, or a nucleic acid sequence(s) (e.g., a reporter gene), respectively, for at least 0.2 
hours, 0.25 hours, 0.5 hours, 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 8 hours, 10 
hours, 12 hours, 18 hours, or at least 1 day. Once the reaction vessel comprises the 
components, i.e., a compound or a member of a library of compounds, the cell-free extract 
25 and the nucleic acid sequence(s) (e.g,, a reporter gene), the reaction may be further 

incubated for at least 0.2 hours, 0.25 hours, 0.5 hours, 1 hour, 2 hours, 3 hours, 4 hours, 5 
hours, 6 hours, 8 hours, 10 hours, 12 hours, 18 hours, or at least 1 day. 

The progress of the reaction in the reporter gene-based assays can be 
measured continuously. Altematively, time-points may be taken at different times of the 
30 reaction to monitor the progress of the reaction in the reporter gene-based assays. 

The reporter gene-based assays described herein may be conducted in a cell 
genetically engineered to express a reporter gene or in vitro utilizing a cell-free extract. 
Any cell or cell line of any species well-known to one of skill in the art may be utilized in 
accordance with the methods of the invention. Further, a cell-free extract may be derived 
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fh>m any cell or cell line of any species well-known to one of skill in the art. Examples of 
cells and cell types include, but are not limited to, human cells, cultured mouse cells, 
cultured rat cells or Chinese hamster ovary ("CHO") cells. 

The reporter gene constructs utilized in the reporter gene-based assays 
5 described herein may comprise the coding region of a reporter gene and a premature stop 
codon that results in premature translation termination and/or nonsense-mediated mRNA 
decay. Preferably, the premature stop codon is N-terminal to the native stop codon of the 
reporter gene and is located such that the suppression of the premature stop codon is readily 
detectable. In a specific embodiment, a reporter gene construct utilized in the reporter gene- 

10 based assays described herein comprises the coding region of a reporter gene containing a 
premature stop codon at least 15 nucleotides, preferably 25 to 50 nucleotides, 50 to 75 
nucleotides or 75 to 100 nucleotides from the start codon in the open reading frame of the 
reporter gene. In another embodiment, a reporter gene constmct utilized in the reporter 
gene-based assays described herein comprises the coding region of a reporter gene 

15 containing a premature stop codon at least 15 nucleotides, preferably 25 to 50 nucleotides, 
50 to 75 nucleotides, 75 to 100 nucleotides, or 100 to 150 nucleotides from the native stop 
codon in the open reading frame of the reporter gene. In another embodiment, a reporter 
gene construct utilized in the reporter gene-based assays described herein comprises the 
coding region of a reporter gene containing a UAG and/or UGA premature stop codon. In 

20 yet another embodiment, a reporter gene construct utilized in the reporter gene based assays 
described herein comprises the coding region of a reporter gene, containing a premature 
stop codon in the context of UGAA, UGAC, UGAG, UGAU, UAGA, UAGC, UAGG. 
UAGU, UAAA, UAAC, UAAG or UAAU.. 

Alternatively, the reporter gene constructs utilized in the reporter gene-based 

25 assays described herein comprise a regulatory element that is responsive to a regulatory 

protein encoded by a nucleic acid sequence containing a premature stop codon. Preferably, 
the premature stop codon in the nucleotide sequence of a regulatory protein or a component 
or subunit thereof is N-terminal to the native stop codon of the regulatory protein or 
component or subunit thereof and the location of the premature stop codon is such that it 

30 alters the biological activity of the regulatory protein (e,g., the abihty of the regulatory 
protein to bind to its regulatory element). In a specific embodiment, the premature stop 
codon in the nucleotide sequence of a regulatory protein or a component or subunit thereof 
is at least 15 nucleotides preferably 25 to 50 nucleotides, 50 to 75 nucleotides or 75 to 100 
nucleotides from the start codon in the open reading frame of the regulatory protein, 
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component or subunit thereof. In another embodiment, the premature stop codon in the 
nucleotide sequence of a regulatory protein or a component or subunit thereof is at least IS 
nucleotides, preferably 25 to 50 nucleotides, 50 to 75 nucleotides, 75 to 100 nucleotides, or 
100 to 150 nucleotides from the native stop codon in the open reading frame of the 
5 regulatory protein, component or subunit thereof. In another embodiment, the premature 
stop codon in the nucleotide sequence of regulatory protein or a component or subunit 
thereof is UAG or UGA. Any reporter gene well-known to one of skill in the art may be 
utilized in the reporter gene constmcts described herein. Examples of reporter genes 
include, but are not limited to, the gene encoding firefly luciferase, the gene coding renilla 
10 luciferase, the gene encoding click beetle luciferase, the gene encoding green fluorescent 
protein, the gene encoding yellow fluorescent protein, the gene encoding red fluorescent 
protein, the gene encoding cyan fluorescent protein, the gene encoding blue fluorescent 
protein, the gene encoding beta-galactosidase, the gene encoding beta-glucoronidase, the 
gene encoding beta-lactamase, the gene encoding chloramphenicol acetyltransferase, and 
1 5 the gene encoding alkaline phosphatase. 

The compounds utilized in the assays described herein may be members of a 
library of compounds. In specific embodiment, the compound is selected from a 
combinatorial library of compounds comprising peptoids; random biooligomers; 
diversomers such as hydantoins, benzodiazepines and dipeptides; vinylogous polypeptides; 
20 nonpeptidal peptidomimetics; oligocarbamates; peptidyl phosphonates; peptide nucleic acid 
libraries; antibody libraries; carbohydrate libraries; and small organic molecule libraries. In 
a preferred embodiment, the small organic molecule libraries are Ubraries of 
benzodiazepines, isoprenoids, thiazolidinones, metathiazanones, pyrrolidines, morpholino 
compounds, or diazepindiones. 
25 In certain embodiments, the compounds are screened in pools. Once a 

positive pool has been identified, the individual compounds of that pool are tested 
separately. In certain embodiments, the pool size is at least 2, at least 5, at least 10, at least 
25, at least 50, at least 75, at least 100, at least 150, at least 200, at least 250, or at least 500 
compounds. 

30 Once a compound that modulates premature translation termination and/or 

nonsense-mediated mRNA decay is identified, the structure of the compound may be 
detemiined utilizing well*known techniques or by referring to a predetermined code. For 
example, the structure of the compound may be determined by mass spectroscopy, NMR, 
vibrational spectroscopy, or X-ray crystallography. 
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A compound identified in accordance with the methods of the invention may 
directly bind to the mRNA translation machinery. Alternatively, a compound identified in 
accordance with the methods of invention may bind to the premature stop codon. A 
compound identified in accordance with the methods of invention may also disrupt an 
5 interaction between a premature stop codon and the mRNA translation machinery. In a 
preferred embodiment, a compound identified in accordance with the methods of the 
invention suppresses premature translation termination and/or nonsense-mediated mRNA 
decay of a gene encoding a protein, polypeptide or peptide whose expression is beneficial to 
a subject. In another preferred embodiment, a compoimd identified in accordance with the 

10 methods of the invention increases premature translation termination and/or nonsense- 
mediated mRNA decay of a gene encoding a protein, polypeptide or peptide whose 
expression is detrimental to a subject. In a specific embodiment, a compound identified in 
accordance with the methods of the invention preferentially or differentially modulates 
premature translation termination and/or nonsense-mediated mRNA decay of a specific 

15 nucleotide sequence of interest relative to another nucleotide sequence, as measured by an 
assay described herein or well known to one of skill in the art under the same or similar 
assay conditions. 

In a specific embodiment, a compound identified in accordance with the 
invention suppresses premature translation termination or nonsense-mediated mRNA decay 

20 of a specific nucleotide sequence of interest by at least 5%, preferably at least 10%, at least 
15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at 
least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, 
at least 85%, at least 90%, or at least 95%, relative to an appropriate control (e.gr-, a negative 
control such as PBS), in an assay described herein under the same or similar assay 

25 conditions. In accordance with this embodiment, preferably, the compound differentially or 
preferentially suppresses the nucleotide sequence of interest relative to another nucleotide 
sequence. 

In certain embodiments of the invention, the compound identified using the 
assays described herein is a small molecule. In a preferred embodiment, the compound 
30 identified using the assays described herein is not known to affect premature translation 
termination and/or nonsense-mediated mRNA decay of a nucleic acid sequence, in 
particular a nucleic acid sequence of interest. In another preferred embodiment, the 
compound identified using the assays described herein has not been used as or suggested to 
be used in the prevention, treatment, management and/or amelioration of a disorder 
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associated with, characterized by or caused by a premature stop codon. In another preferred 
embodiment, the compound identified using the assays described herein has not been used 
as or suggested to be used in the prevention, treatment, management and/or amelioration of 
a particular disorder described herein. 

A compound identified in accordance with the methods of the invention may 
be tested in in vitro and/or in vivo assays well-known to one of skill in the art or described 
herein to determine the prophylactic or therapeutic effect of a particular compound for a 
particular disorder. In particular, a compound identified utihzing the assays described 
herein may be tested in an animal model to determine the efficacy of the compound in the 
prevention, treatment or amelioration of a disorder associated with, characterized by or 
caused by a premature stop codon, or a disorder described herein, or a symptom thereof. In 
addition, a compound identified utilizing the assays described herein may be tested for its 
toxicity in in vitro and/or in vivo assays well-known to one of skill in the art. 

The invention provides for methods for preventing, treating, managing or 
ameliorating a disorder associated with, characterized by or caused by a premature stop 
codon or a symptom thereof, said method comprising administering to a subject in need 
thereof a therapeutically or prophylactically effective amount of a compound, or a 
pharmaceutically acceptable salt thereof, identified according to the methods described 
herein. 

The present invention may be understood more fully by reference to the 
detailed description and examples, which are intended to illustrate non-limiting 
embodiments of the invention. 

3.1. Termipology 

As used herein, the term "compound" refers to any agent or complex that is 
being tested for its ability to modulate premature translation termination and/or nonsense- 
mediated mRNA decay or has been identified as modulating premature translation 
termination and/or nonsense-mediated mRNA decay. 

As used herein, the terms "disorder" and "disease" are to refer to a condition 
in a subject. In a specific embodiment, the terms disease and disorder refer to a condition in 
a subject that is associated with, characterized by, or caused by premature translation 
termination and/or nonsense-mediated mRNA decay of one or more gene products. Non- 
limiting examples of such disease and disorders are described herein below. 

As used herein, the term "effective amount" refers to the amount of a 

compound which is sufficient to (i) reduce or ameliorate the progression, severity and/or 
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duration of a disorder associated with, characterized by or caused by premature translation 
termination and/or nonsense-mediated mRNA decay, or one or more symptoms thereof, (ii) 
prevent the development, recurrence or onset of a disorder associated with, characterized by 
or caused by premature translation termination and/or nonsense-mediated mRNA decay, or 
5 one or more symptoms thereof, (iii) prevent the advancement of a disorder associated with, 
characterized by or caused by premature translation termination and/or nonsense-mediated 
mRNA decay, or one or more symptoms thereof, or (iv) enhance or improve the 
therapeutic(s) effect(s) of another therapy. 

As used herein, the term "host cell" includes a particular subject cell 

10 transfected with a nucleic acid molecule and the progeny or potential progeny of such a cell. 
Progeny of such a cell may not be identical to the parent cell transfected with the nucleic 
acid molecule due to mutations or environmental influences that may occur in succeeding 
generations or integration of the nucleic acid molecule into the host cell genome. 

As used herein, the term "in combination" refers to the nse of more than one 

15 therapy {e.g. , prophylactic and/or therapeutic agents). The use of the term "in combination" 
does not restrict the order in which therapies {e.g., prophylactic and/or therapeutic agents) 
are administered to a subject with a disorder associated with, characterized by or caused by 
premature translation termination and/or nonsense-mediated mRNA decay. A first therapy 
(e.^., a prophylactic or therapeutic agent such as a compound identified in accordance with 

20 the methods of the invention) can be administered prior to {e.g, 5 minutes, 15 minutes, 30 
minutes, 45 minutes, 1 hour, 2 hours, 4 hours, 6 hours, 12 hours, 24 hours, 48 hours, 72 
hours, 96 hours, 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 8 weeks, or 12 
weeks before), concomitantly with, or subsequent to (e.g., 5 minutes, 15 minutes, 30 
minutes, 45 minutes, 1 hour, 2 hours, 4 hours, 6 hours, 12 hours, 24 hours, 48 hours, 72 

25 hours, 96 hours, 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 8 weeks, or 12 

weeks after) the administration of a second ther^y {e.g., a prophylactic or therapeutic agent 
such as a chemotherapeutic agent or a TNF-a antagonist) to a subject with a disorder 
associated with, characterized by or caused by premature translation termination and/or 
nonsense-mediated mRNA decay. 

30 As used herein, the term "library" in the context of compounds refers to a 

plurality of compounds. A library can be a combinatorial library, e.g., a collection of 
compoimds synthesized using combinatorial chemistry techniques, or a collection of unique 
chemicals of low molecular weight (less than 1000 daltons) that each occupy a unique 
three-dimensional space. 
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As used herein, the tenns "manage", "managing" and "management" refer to 
the beneficial effects that a subject derives from a therapy (e.g^., a prophylactic or 
therapeutic agent) which does not result in a cure of the disorder associated with, 
characterized by or caused by premature translation termination and/or nonsense-mediated 
mRNA decay. In certain embodiments, a subject is administered one or more therapies to 
"manage" a disease or disorder so as to prevent the progression or worsening of the disease 
or disorder. 

As used herein, the phrase "modulation of premature translation termination 
and/or nonsense-mediated mRNA decay" refers to the regulation of gene expression by 
altering the level of nonsense suppression. For example, if it is desirable to increase 
production of a defective protein encoded by a gene with a premature stop codon, to 
permit readthrough of the premature stop codon of the disease gene so translation of the 
gene can occur, then modulation of premature translation termination and/or nonsense- 
mediated mRNA decay entails up*regulation of nonsense suppression. Conversely, if it is 
desirable to promote the degradation of an mRNA with a premature stop codon, then 
modulation of premature translation termination and/or nonsense-mediated mRNA decays 
entails down-regulation of nonsense suppression. 

As used herein, the terms "non-responsive" and refractory" describe patients 
treated with a currently available therapy (e.g., prophylactic or therapeutic agent) for a 
disorder associated with, characterized by or caused by premature translation termination 
and/or nonsense-mediated mRNA decay (e.g., cancer), which is not clinically adequate to 
relieve one or more symptoms associated with such disorder. Typically, such patients suffer 
from severe, persistently active disease and require additional therapy to ameliorate the 
symptoms associated with their disoder. 

As used herein, "nonsense-mediated mRNA deca/' refers to any mechanism 
that mediates the decay of mRNAs containing a premature translation termination codon. 

As used herein, a "nonsense mutation" is a point mutation changing a codon 
corresponding to an amino acid to a stop codon. 

As used herein, "nonsense suppression" refers to the inhibition or 
suppression of premature translation termination and/or nonsense-mediated mRNA decay. 

The terms "nucleic acid," "nucleic acid sequence," "nucleotide sequence," 
and analagous terms as used herein include DNA molecules (e.g., cDNA or genomic DNA), 
RNA molecules {e.g., mRNA), combinations of DNA and RNA molecules of hybrid 
DNA/RNA molecules, and analogs of DNA or RNA molecules. Such analogs can be 
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generated using, for example, nucleotide analogs, which include, but are not limited to, 
inosine or tritylated bases. Such analogs can also comprise DNA or RNA molecules 
comprising modified backbones that lend beneficial attributes to the molecules, such as, for 
example, nuclease resistance or an increased ability to cross cellular membranes. The 
5 nucleic acids, nucleic acid sequences or nucleotide sequences can be single-stranded, 

double-stranded, may contain both single-stranded and double-stranded portions, and may 
contain triple-stranded portions, but prefereably is double-stranded DNA. In one 
embodiment, the nucleotide sequences comprise a contiguous open reading frame encoding 
a reporter gene, e.g,, a cDNA molecule. 

10 As used herein, the phrase "pharmaceutically acceptable salt(s)," includes, 

but is not limited to, salts of acidic or basic groups that may be present in compounds 
identified using the methods of the present invention. Compoimds that are basic in nature 
are capable of forming a wide variety of salts with various inorganic and organic acids. The 
acids that can be used to prepare pharmaceutically acceptable acid addition salts of such 

1 5 basic compounds are those that form non-toxic acid addition salts, i.e., salts containing 
pharmacologically acceptable anions, including but not limited to sulfuric, citric, maleic, 
acetic, oxalic, hydrochloride, hydrobromide, hydroiodide, nitrate, sulfate, bisulfate, 
phosphate, acid phosphate, isonicotinate, acetate, lactate, salicylate, citrate, acid citrate, 
tartrate, oleate, taimate, pantothenate, bitartrate, ascorbate, succinate, maleate, gentisinate, 

20 fiimarate, gluconate, glucaronate, saccharate, formate, benzoate, glutamate, 

methanesulfonate, ethanesulfonate, benzenesulfonate, p-toluenesulfonate and pamoate (/.^., 
l,r-methylene-bis-(2-hydroxy-3-naphthoate)) salts. Compounds that include an amino 
moiety may form pharmaceutically acceptable salts with various amino acids, in addition to 
the acids mentioned above. Compounds that are acidic in nature are capable of forming 

25 base salts with various pharmacologically acceptable cations. Examples of such salts 
include alkali metal or alkaline earth metal salts and, particularly, calcium, magnesium, 
sodium lithium, zinc, potassium, and iron salts. 

As used herein, "premature translation termination" refers to the result of a 
mutation that changes a codon corresponding to an amino acid to a stop codon. 

30 As used herein, the terms "prevent", " preventing" and "prevention" refer to 

the prevention of the development, recurrence or onset of a disorder associated with, 
characterized by or caused by premature translation termination and/or nonsense-mediated 
mRNA decay or one or more symptoms thereof resulting from the administration of one or 
more compounds identified in accordance the methods of the invention or the 
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administration of a combination of such a compound and a knoAvn therapy for such a 
disorder. 

As used herein, the term "previously detemiined reference range" refers to a 
reference range for the readout of a particular assay. In a specific embodiment, the term 
refers to a reference range for the expression of a reporter gene and/or the activity of a 
reporter gene product by a particular cell or in a particular cell-free extract. Each laboratory 
will establish its own reference range for each particular assay, each cell type and each cell- 
free extract. In a preferred embodiment, at least one positive control and at least one 
negative control aio included in each batch of compounds analyzed. 

As used herein, the terms "prophylactic agent" and "prophylactic agents" 
refer to any agent(s) which can be used in the prevention of a disorder associated with, 
characterized by or caused by premature translation termination and/or nonsense-mediated 
mRNA decay. In certain embodiments, the term "prophylactic agent" refers to a compound 
identified in the screening assays described herein. In certain other embodiments, the terai 
"prophylactic agent" refers to an agent other than a compound identified in the screening 
assays described herein which is known to be usefiil for, or has been or is currently being 
used to prevent or impede the onset, development and/or progression of a disorder 
associated with, characterized by or caused by premature translation termination and/or 
nonsense-mediated mRNA decay or one or more symptoms thereof. 

As used herein, the phrase "prophylactically effective amount" refers to the 
amount of a therapy {e.g., a prophylactic agent) which is sufiBcient to result in the 
prevention of the development, recurrence or onset of one or more symptoms associated 
with a disorder associated with, characterized by or caused by premature translation 
termination and/or nonsense-mediated mRNA decay. 

As used herein, the term "purified" in the context of a compound, e.g.^ a compound 
identified in accordance with the method of the invention, refers to a compound that is 
substantially free of chemical precursors or other chemicals when chemically synthesized. 
In a specific embodiment, the compound is 60%, preferably 65%, 70%, 75%, 80%, 85%, 
90%, or 99% free of other, different compounds. In a preferred embodiment, a compound 
identified in accordance with the methods of the invention is purified. 

As used herein, a "premature termination codon" or "premature stop codon" 
refers to the occurrence of a stop codon instead of a codon corresponding to an amino acid. 

As used herein, a "reporter gene" refers to a gene by which modulation of 
premature translation termination and/or nonsense-mediated mRNA decay is ascertained. 
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In a preferred embodiment, the expression of a reporter gene is easily assayed and has an 
activity which is not normally found in the organism of which the translation extract is 
derived. 

As used herein, the term "small molecule" and analogous terms include, but 
are not limited to, peptides, peptidomimetics, amino acids, amino acid analogs, 
polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, organic or 
inorganic compounds (i.e., including heterorganic and/or ganometallic compounds) having 
a molecular weight less than about 10,000 grams per mole, organic or inorganic compounds 
having a molecular weight less than about 5,000 grams per mole, organic or inorganic 
compounds having a molecular weight less than about 1,000 grams per mole, organic or 
inorganic compounds having a molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutically acceptable forms of such compounds. 

As used herein, the terms "subject" and "patient" are used interchangeably 
herein. The terms "subject" and "subjects" refer to an animal, preferably a mammal 
including a non-primate (e.g., a cow, pig, horse, cat, dog, rat, and mouse) and a primate 
ie.g., a chimpanzee, a monkey such as a cynomolgous monkey and a human), and more 
preferably a human. In one embodiment, the subject is refractory or non-responsive to 
current therapies for a disorder associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay. In another embodiment, 
the subject is a farm animal (e.g., a horse, a cow, a pig, etc.) or a pet {e.g., a dog or a cat). 
In a preferred embodiment, the subject is a human. 

As used herein, the term "synergistic" refers to a combination of a compound 
identified using one of the methods described herein, and another therapy (e.g., a 
prophylactic or therapeutic agent), which combination is more effective than the additive 
effects of the therapies. A synergistic effect of a combination of therapies {eg., 
prophylactic or therapeutic agents) permits the use of lower dosages of one or more of the 
therapies and/or less frequent administration of said therapies to a subject with a 
proliferative disorder. The ability to utilize lower dosages of a therapy ie.g., a prophylactic 
or therapeutic agent) and/or to administer said therapy less frequently reduces the toxicity 
associated with the administration of said therapy to a subject without reducing the efficacy 
of said therapies in the prevention, treatment, management or amelioration of a disorder 
associated with, characterized by or caused by premature translation termination and/or 
nonsense-mediated mRNA decay. In addition, a synergistic effect can result in improved 
efficacy of therapies (e.g., agents) in the prevention, treatment, management or amelioration 
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of a disorder associated with, characterized by or caused by premature translation 
termination and/or nonsense-mediated mRNA decay. Finally, a synergistic effect of a 
combination of therapies (e.g., prophylactic or therapeutic agents) may avoid or reduce 
adverse or unwanted side effects associated with the use of either therapy alone. 
5 As used herein, the terms "therapeutic agent" and "therapeutic agents*' refer 

to any agent(s) which can be used in the prevention, treatment, management or amelioration 
of one or more symptoms of a disorder associated with, characterized by or caused by 
premature translation termination and/or nonsense-mediated mRNA decay. In certain 
embodiments, the term "therapeutic agent" refers to a compound identified in the screening 

10 assays described herein. In other embodiments, the term 'therapeutic agent" refers to an 
agent other than a compound identified in the screening assays described herein which is 
known to be useful for, or has been or is currently being used to prevent, treat, manage or 
ameliorate a proliferative disorder or one or more symptoms thereof. 

As used herein, the term 'therapeutically effective amount" refers to that 

15 amount of a therapy (e.g., a therapeutic agent) sufficient to result in (i) the amelioration of 
one or more symptoms of a disorder associated with, characterized by or caused by 
premature translation termination and/or nonsense-mediated mRNA decay, (ii) prevent 
advancement of a disorder associated with, characterized by or caused by premature 
translation temiination and/or nonsense-mediated mRNA decay, (iii) cause regression of av 

20 disorder associated with, characterized by or caused by premature translation termination 
and/or nonsense-mediated mRNA decay, or (iv) to enhance or improve the therapeutic 
efrect(s) of another therapy (e.g., therapeutic agent). 

As used herein, the terms 'treat", "treatment" and "treating" refer to the 
reduction or amelioration of the progression, severity and/or duration of a disorder 

25 associated with, characterized by or caused by premature translation termination and/or 
nonsense-mediated mRNA decay or one or more symptoms thereof resulting from the 
administration of one or more compounds identified in accordance the methods of the 
invention, or a combination of one or more compounds identified in accordance with the 
invention and another therapy. 

30 As used herein, the terms "therapy" and "therapies" refer to any method, 

protocol and/or agent that can be used in the prevention, treatment, management or 
amelioration of a disesase or disorder or one or more symptoms thereof. In certain 
embodiments, such terms refer to chemotherapy, radiation therapy, surgery, supportive 
therapy and/or other therapies useful in the prevention, treatment, management or 
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amelioration of a disease or disorder or one or more symptoms thereof known to skilled 
medical personnel. 

4. BRIEF DESCRIPTION OF THE FIGURES 

5 FIG. I . Translation of a wild-type luciferase RNA in the in vitro translation reaction. 

Reaction mixtures were prepared containing varying amounts of wild-type 
luciferase RNA and varying amounts of HeLa cell extract. The amount of 
luciferase produced was monitored in a Turner luminometer by the addition 
of luciferase substrate (Promega). 

1 0 FIG. 2. Translation of a nonsense containing (UGA) luciferase RNA in the in vitro 

' translation reaction. Reaction mixtures were prepared containing varying 

amounts of luciferase RNA harboring a UGA nonsense mutation and varying 
amounts of HeLa cell extract. The amount of luciferase was monitored in a 
Tumer luminometer by the addition of luciferase substrate (Promega). 

15 FIG. 3. Translation of wild-type luciferase RNA by incubating the cells on ice prior 

to lysis. HeLa cell pellets were incubated on ice or not incubated on ice prior 
to lysis and the effect of the incubation on the translation activity of the cell- 
extract was measured in an in vitro translation reaction for luciferase 
production 

20 FIG. 4 Translation of a nonsense (UGA) containing luciferase RNA in the in vitro 

^ translation reaction. Reaction mixtures were prepared with luciferase RNA 

containing a UGA nonsense mutation. Gentamicin was (GENT) or was not 
added (UNT) added to the reaction mixture and the amount of luciferase 
produced was monitored in a Viewlux luminometer by the addition of 
25 luciferase substrate (Promega). 

FIG. 5. The amount of luciferase produced was monitored in a Viewlux luminometer 

" ^ by the addition of luciferase substrate (Promega). 

FIG. 6A-6B. 6A: Nonsense suppression in cells harboring a luciferase nonsense allele. 

Stable cell lines harboring the UGA, UAA and UAG nonsense alleles of 
30 luciferase were treated overnight with Compound A, Compound B, and 

Gentamicin. The following day, the level of suppression was determined by 
measuring the amount of luminescence produced. The fold suppression 
above control cells treated with solvent was calculated and plotted vs. 
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concentration of compound. 6B: Nonsense suppression in cells harboring a 
luciferase nonsense allele. Stable cell lines harboring the UGA, UAA and 
UAG nonsense alleles of luciferase were treated overnight with Compound 
A, and gentamicin. The following day, the level of suppression was 
5 determined by measuring the amount of luminescence produced. The fold 

suppression above control cells treated with solvent was calculated and 
plotted vs. concentration of compound. 
FIG. 7A-7B. Chemical footprinting analysis of Compound A on the human 28S rRNA. 

100 pmol of ribosomes were uicubated with 100 /iM compound, followed by 

10 treatment with chemical modifying agents (dimethyl sulfate [DMS] and 

kethoxal [KB]). Following chemical modification, rRNA was prepared and 
analyzed in primer extension reactions using end-labeled oligonucleotides 
hybridizing to rRNA. Panel A (lanes 1-3 DMS modification; lanes 4-6 KE 
modification): Lanes 1 and 4, DMSO treated; 2 and 5, paromomycin treated; 

15 3 and 6, Compound A treated; 4. A sequencing reaction (indicated by lanes 

GATC in panel A) was run in parallel as a marker. 
FIG. 8. Functional CFTR expression monitored as cAMP-induced anion efflux using 

the halide-sensitive fluorophore 6-methoxy-N-(3-suIphopro-pyl) quinolinium 
(SPQ). Compound A increases cAMP-stimulated chloride channel activity 

20 in cells expressing the W1282X mutation. Cells were initially loaded in a 

hypotonic buffer containing SPQ and sodium iodide; iodide quenches SPQ 
fluorescence (Yang et al, 1993, Hum Mol Genet. 2(8): 1253-1261). Sodium 
iodide in the bath was replaced by sodium nitrate at 2 min; since nitrate does 
not interact with SPQ, fluorescence increased as cell iodide is lost to the 

25 bath. A cAMP stimulation cocktail (10 /iM forskolin, 100 {iM cpt-cAMP 

and 100 IBMX) was added at 6 min. Fluorescence was then quenched 
again by returning sodium iodide to the bath at 10 min. Functional CFTR 
expression was monitored as the dequenching of SPQ fluorescence caused 
by cAMP-induced iodide efiHux. 

30 FIG. 9. Immunohistochemistry of myotubes from primary cell culture from mdx 

' ^ muscle. The presence of dystrophin was detected by mAb to the 

COOH-terminus of dystrophin (F192A12) followed by a 
rhodamine-conjugated anti-mouse IgG. Dystrophin was present in mdx 
myotubes treated with 20 fiM Compound A (left) and in mdx myotubes 
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treated with 200 /tM gentamicin (center). Little dystrophin was detected in 
untreated mdx myotubes (right). 
FIG. 1 OA- 1 OF. Immunohistochemistry of muscle cross-sections to view dystrophin. C57 
control tibialis anterior (TA) muscle displayed positive staining for 
5 dystrophin (panel D). Muscle cross-sections from mdx mice treated with 

gentamicin (200 fiM, panel A) and Compound A (10 panel B; 20 fiM 
panel C) displayed positive staining for dystrophin. Muscle from untreated 
mdx mice (panel E) or from cross sections not treated with primary antibody 
(panel F) show only minimal staining. 

10 5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides methods for identifying compounds that 
modulate premature translation termination and/or nonsense-mediated mRNA decay. In 
particular, the invention provides simple, rapid and sensitive methods for identifying 
compounds that modulate premature translation termination and/or nonsense-mediated 

15 mRNA decay. Any gene encoding a premature stop codon can be used in the cell-based and 
cell-free assays described herein to identify compounds that modulate premature translation 
termination and/or nonsense-mediated mRNA decay. The cell-based and cell-free assays 
described herein can be utilized in a high throughput format to screen libraries of 
compounds to identify those compounds that modulate premature translation termination 

20 and/or nonsense-mediated mRNA decay. 

Reporter gene-based assays can be utilized to identify a compound that 
modulates premature translation termination and/or nonsense-mediated mRNA decay. The 
reporter gene-based assays described herein may be conducted by contacting a compound 
with a cell containing a nucleic acid sequence comprising a reporter gene, wherein said 

25 reporter gene comprises a premature stop codon, and measuring the expression and/or 
activity of the reporter gene. Alternatively, the reporter gene-based assays may be 
conducted by contacting a compound with a cell-free extract and a nucleic acid sequence 
comprising a reporter gene, wherein said reporter gene comprises a premature stop codon, 
and measuring the expression of said reporter gene. The reporter gene-based assays may 

30 also be conducted by: (a) contacting a compound with a cell containing a first nucleic acid 
sequence and a second nucleic acid sequence, wherein the first nucleic acid sequence 
comprises a reporter gene operably linked to a regulatory element and the second nucleic 
acid sequence comprises a nucleotide sequence encoding a regulatory protein or a subimit 
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thereof with a premature stop codon and the regulatory protein regulates the expression of 
the reporter gene; and (b) measuring the expression and/or activity of the reporter gene. 
Further, the reporter gene-based assays may be conducted by: (a) contacting a compound 
with a cell-free extract, a first nucleic acid sequence and a second nucleic acid sequence, 
5 wherein the first nucleic acid sequence comprises a reporter gene operably linked to a 

regulatory element and the second nucleic acid sequence comprises a nucleotide sequence 
encoding a regulatory protein or a subunit thereof with a premature stop codon and the 
regulatory protein regulates the expression of the reporter gene; and (b) measuring the 
expression and/or activity of the reporter gene. The alteration in reporter gene expression 

10 relative to a previously determined reference range, or the expression of the reporter gene in 
the absence of the compound or an appropriate control (e.g,j a negative control) in such 
reporter-gene based assays indicates that a particular compound modulates premature 
translation termination and/or nonsense-mediated mRNA decay. 

The stmcture of the compounds identified in the assays described herein that 

15 modulate changes in post-transcriptional gene regulation can be determined utilizing assays 
well-known to one of skill in the art or described herein. The methods used will depend, in 
part, on the nature of the library screened. For example, assays or microarrays of 
compounds, each having an address or identifier, may be deconvoluted, e.g., by cross- 
referencing the positive sample to an original compound list that was applied to the 

20 individual test assays. Alternatively, the structure of the compounds identified herein may 
be determined using mass spectrometry, nuclear magnetic resonance ("NMR"), X ray 
crystallography, or vibrational spectroscopy. 

The invention encompasses the use of the compounds that modulate 
premature translation termination and/or nonsense-mediated mRNA decay that were 

25 identified in accordance with the methods described herein. In particular, the invention 
encompasses the use of compounds identified as lead compounds for the development of 
prophylactic or therapeutic agents in the prevention, treatment, management and/or 
amelioration of a disease associated with, characterized by or caused by a nonsense 
mutation. Such diseases include, but are not limited to, cystic fibrosis, muscular dystrophy, 

30 heart disease, cancer, retinitis pigmentosa, collagen disorders, Tay-Sachs disease, blood 
disorders, kidney stones, ataxia-telangiectasia, lysosomal storage diseases, and tuberous 
sclerosis. 

Section 5.1 describes genes with premature translation stop codons and cells 
and cell-free extracts that are useful in the methods of the invention. Section 5.2 describes 
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libraries of compounds. Section 5.4 describes reporter gene-based assays for identifying 
compounds that modulate premature translation termination and/or nonsense-mediated 
mRNA decay. Section 5.5 describes naturally occurring genes with premature stop codons 
and examples of diseases associated with such genes. Section 5.6 describes secondary 
5 biological screens. Section 5.7 describes the methods for designing congeners or analogs of 
compounds identified in accordance with the methods of the invention. Section 5.8 
describes uses of compounds identified in accordance with the methods of the invention for 
preventing, treating, managing or ameliorating a disease or abnormal condition in a subject 
associated with, characterized by or caused by a premature stop codon. Section 5.9 
10 describes methods of administering compounds identified in accordance with the invention 
to a subject in need thereof. 

5.1. Reporter Gene Constructs^ Transfected Cells and Cell-Free Extracts 

The invention provides for reporter genes to ascertain the effects of a 
compound on premature translation termination and/or nonsense-mediated mRNA decay. 
15 In general, the level of expression and/or activity of a reporter gene product is indicative of 
the effect of the compound on premature translation termination and/or nonsense-mediated 
mRNA decay. 

The invention provides for specific vectors comprising a reporter gene 
operably linked to one or more regulatory elements and host cells transfected with the 

20 vectors. The invention also provides for the in vitro translation of a reporter gene flanked 
by one or more regulatory elements. A reporter gene may or may not contain a premature 
stop codon depending on the assay conducted. Techniques for practicing this specific 
aspect of this invention will employ, unless otherwise indicated, conventional techniques of 
molecular biology, microbiology, and recombinant DNA manipulation and production, 

25 which are routinely practiced by one of skill in the art. See, e.^., Sambrook, 1989, 

Molecular Cloning, A Laboratory Manual, Second Edition; DNA Cloning, Volumes I and II 
(Glover, Ed. 1985); Oligonucleotide Synthesis (Gait, Ed. 1984); Nucleic Acid Hybridization 
(Hames & Higgins, Eds. 1984); Transcription and Translation (Hames & Higgins, Eds. 
1984); Animal Cell Culture (Freshney, Ed. 1986); Immobilized Cells and Enzymes (IRL 

30 Press, 1986); Perbal, A Practical Guide to Molecular Cloning (1984); Gene Transfer 

Vectors for Mammalian Cells (Miller & Calos, Eds. 1987, Cold Spring Harbor Laboratory); 

Methods in Enzymology, Volumes 154 and 155 (Wu & Grossman, and Wu, Eds., 

respectively), (Mayer & Walker, Eds.. 1987); Immunochemical Methods in Cell and 

Molecular Biology (Academic Press, London, Scopes, 1987), Expression of Proteins in 
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Mammalian Cells Using Vaccinia Viral Vectors in Current Protocols in Molecular Biology, 
Volume 2 (Ausubel et al., Eds., 1991). 

Reporter Genes 

Any reporter gene well-known to one of skill in the art may be used in 
5 reporter gene constructs to ascertain the effect of a compound on premature translation 
termination. Reporter genes refer to a nucleotide sequence encoding a protein, polypeptide 
or peptide that is readily detectable either by its presence or activity. Reporter genes may be 
obtained and the nucleotide sequence of the elements determined by any method well- 
known to one of skill in the art. The nucleotide sequence of a reporter gene can be 

10 obtained, e.g.^ from the literature or a database such as GenBank. Altematively, a 

polynucleotide encoding a reporter gene may be generated from nucleic acid from a suitable 
source. If a clone containing a nucleic acid encoding a particular reporter gene is not 
available, but the sequence of the reporter gene is known, a nucleic acid encoding the 
reporter gene may be chemically synthesized or obtained from a suitable source (e.^., a 

15 cDNA library, or a cDNA library generated from, or nucleic acid, preferably poly A+ RNA, 
isolated from, any tissue or cells expressing the reporter gene) by PCR amplification. Once 
the nucleotide sequence of a reporter gene is determined, the nucleotide sequence of the 
" reporter gene may be manipulated using methods well-known in the art for the manipulation 
of nucleotide sequences, e.g., recombinant DNA techniques, site directed mutagenesis, 

20 PCR, etc. (see, for example, the techniques described in Sambrook et al., 1990, Molecular 
Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY and Ausubel et aL, eds., 1998, Current Protocols in Molecular Biology, John 
Wiley & Sons, NY, which are both incorporated by reference herein in their entireties), to 
generate reporter genes having a different amino acid sequence, for example to create amino 

25 acid substitutions, deletions, and/or insertions. 

In a specific embodiment, a reporter gene is any naturally-occurring gene 
with a premature stop codon. Genes with premature stop codons that are useful in the 
present invention include, but are not limited to, the genes described below. In an 
alternative embodiment, a reporter gene is any gene that is not known in nature to contain a 

30 premature stop codon. Examples of reporter genes include, but are not limited to, luciferase 
(e.g., firefly luciferase, renilla luciferase, and click beetle luciferase), green fluorescent 
protein ("GFP") (e.g., green fluorescent protein, yellow fluorescent protein, red fluorescent 
protein, cyan fluorescent protein, and blue fluorescent protein), beta-galactosidase ("beta- 
gal"), beta-glucoronidase, beta-lactamase, chloramphenicol acetyltransferase ("CAT'), and 
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alkaline phosphatase ("AP"). Alternatively, a reporter gene can also be a protein tag, such 
as, but not limited to, myc. His, FLAG, or GST, so that nonsense suppression will produce 
the peptide and the protein can be monitored by an ELIS A, a western blot, or any other 
immunoassay to detect the protein tag. Such methods are well known to one of skill in the 
art. In a preferred embodiment, the reporter gene is easily assayed and has an activity 
which is not normally found in the gene of interest. Table 1 below lists various reporter 
genes and the properties of the products of the reporter genes that can be assayed. In a 
preferred embodiment, a reporter gene utilized in the reporter constructs is easily assayed 
and has an activity which is not normally found in the cell or organism of interest. 



TABLE 1 : Reporter Genes and the Properties of the Reporter Gene Products 



Reporter Gene 


Protein Activity & Measurement 


CAT (chloramphenicol acetyltransferase) 


Transfers radioactive acetyl groups to 
chloramphenicol or detection by thin layer 
chromatography and autoradiography 


GAL (beta-galactosidase) 


Hydrolyzes colorless galactosides to yield 
colored products. 


GUS (beta-glucuronidase) 


Hydrolyzes colorless glucuronides to yield 
colored products. 


LUC (luciferase) 


Oxidizes luciferin, emitting photons 


GFP (green fluorescent protein) 


Fluorescent protein without substrate 


SEAP (secreted alkaline phosphatase) 


Luminescence reaction with suitable substrates 
or with substrates that generate chromophores 


HRP (horseradish peroxidase) 


In the presence of hydrogen oxide, oxidation of 
3,3',5,5*-tetramethylbenzidine to form a colored 
complex 


AP (alkaline phosphatase) 


Luminescence reaction with suitable substrates 
or with substrates that generate chromophores 



Described hereinbelow in further detailed are specific reporter genes and 



characteristics of those reporter genes. 

5.U.I. Luciferase 

Luciferases are enzymes that emit light in the presence of oxygen and a 
substrate (luciferin) and which have been used for real-time, low-light imaging of gene 
expression in cell cultures, individual cells, whole organisms, and transgenic organisms 
(reviewed by Greer & Szalay, 2002, Luminescence 1 7(l):43-74). 
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As used herein, the term "luciferase" is intended to embrace all luciferases, 
or recombinant enzymes derived from luciferases which have luciferase activity. The 
luciferase genes from fireflies have been v^^ell characterized, for example, from the Photinus 
and Luciola species (see, e.g.. International Patent Publication No. WO 95/25798 for 
Photinus pyralis^ European Patent Application No. EP 0 524 448 for Luciola cruciata and 
Luciola lateralis^ and Devine et al., 1993, Biochim. Biophys. Acta 1 173(2):121-132 for 
Luciola mingrelica). Other eucaryotic luciferase genes include, but are not limited to, the 
click beetle {Photinus plagiophthalamus, see, e.g.. Wood et aL, 1989, Science 244:700- 
702), the sea panzy (Renilla reniformis, see, e,g., Lorenz et al., 1991, Proc Natl Acad Sci U 
S A 88(10):443 8-4442), and the glow worm (Lampyris noctiluca, see e.g, Sula-Newby et 
aL, 1996, Biochem J. 313:761-767). The click beetle is unusual in that different members 
of the species emit bioluminescence of different colors, which emit light at 546 ran (green), 
560 ran (yellow-green), 578 ran (yellow) and 593 nm (orange) (see, e.g. U.S. Patent Nos. 
6,475,719; 6,342,379; and 6,217,847, the disclosures of which are incorporated by reference 
in their entireties). Bacterial luciferin-luciferase systems include, but are not limited to, the 
bacterial lux genes of terrestrial Photorhabdus luminescens (see, e,g^ Manukhov et aL, 
2000, Genetika 36(3):322-30) and marine bacteria Vibrio fischeri and Vibrio harveyi (see, 
eg., Miyamoto et al., 1988, J Biol Chem. 263(26): 13393-9, and Cohn et aL, 1983, Proc Natl 
Acad Sci USA., 80(1): 120-3, respectively). The luciferases encompassed by the present 
invention also includes the mutant luciferases described in U.S. Patent No. 6,265,177 to 
Squirrell et aL, which is hereby incorporated by reference in its entirety. 

In a specific embodiment, the luciferase is a firefly luciferase, a renilla 
luciferase, or a click beetle luciferase, as described in any one of the references listed supra, 
the disclosures of which are incorporated by reference in their entireties. 

5.1*1.2. Green Fluorescent Protein 

Green fluorescent protein ("GFP") is a 238 amino acid protein with amino 
acid residues 65 to 67 involved in the formation of the chromophore which does not require 
additional substrates or cofactors to fluoresce (see, e.g., Prasher et aL, 1992, Gene 1 1 1:229- 
233; Yang et al., 1996, Nature Biotechnol. 14:1252-1256; and Cody et aL, 1993, 
Biochemistry 32: 1 2 1 2- 1 2 1 8). 

As used herein, the term "green fluorescent protein" or "GFP" is intended to 

embrace all GFPs (including the various forms of GFPs which exhibit colors other than 

green), or recombinant enzymes derived from GFPs which have GFP activity. In a 

preferred embodiment, GFP includes green fluorescent protein, yellow fluorescent protein, 
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red fluorescent protein, cyan fluorescent protein, and blue fluorescent protein. The native 
gene for GFP was cloned from the bioluminescent jellyfish i4e9i*orca victoria (see, e.g., 
Morin et al., 1972, J. Cell Physiol. 77:313-318). Wild type GFP has a major excitation peak 
at 395 nm and a minor excitation peak at 470 nm. The absorption peak at 470 nm allows 

5 the monitoring of GFP levels using standard fluorescein isothiocyanate (FITC) filter sets. 
Mutants of the GFP gene have been found usefiil to enhance expression and to modify 
excitation and fluorescence. For example, mutant GFPs with alanine, glycine, isoleucine, or 
threonine substituted for serine at position 65 result in mutant GFPs with shifts in excitation 
maxima and greater fluorescence than wild type protein when excited at 488 nm (see, e.g., 

10 Heim et al., 1995, Nature 373:663-664; U.S. Patent No. 5,625,048; Delagrave et al., 1 995, 
Biotechnology 13:151-154; Cormack et al., 1996, Gene 173:33-38; and Cramer et al., 1996, 
Nattire Biotechnol. 14:315-319). The ability to excite GFP at 488 nm peraaits the use of 
GFP with standard fluorescence activated cell sorting ("FACS") equipment. In another 
embodiment, GFPs are isolated from organisms other than the jellyfish, such as, but not 

15 limited to, the sea pansy, Renilla reriformis. 

Techniques for labeling cells with GFP in general are described in U.S. 
Patent Nos. 5,491,084 and 5,804,387, which are incorporated by reference in their 
entireties; Chalfie et al., 1994, Science 263:802-805; Heim et al., 1994, Proc. Nati. Acad. 
Sci. USA 91:12501-12504; Morise et al., 1974, Biochemistiry 13:2656-2662; Ward et al., 

20 1980, Photochem. Photobiol. 31:61 1-615; Rizzuto et al., 1995, Curr. Biology 5:635-642; 
and Kaether & Gerdes, 1995, FEBS Lett 369:267-271. The expression of GFPs in ^. coli 
and C. elegans are described in U.S. Patent No. 6,251,384 to Tan et al., which is 
incorporated by reference in its entirety. The expression of GFP in plant cells is discussed 
in Hu & Cheng, 1995, FEBS Lett 369:331-33, and GFP expression in Drosophila is 

25 described in Davis et al., 1995, Dev. Biology 170:726-729. 

5.1.1.3. Beta Galactosidase 

Beta galactosidase ("beta-gal") is an enzyme that catalyzes the hydrolysis of 
beta-galactosides, including lactose, and the galactoside analogs o-nitrophenyl-beta-D- 
galactopyranoside ("ONPG") and chlorophenol red-beta-D-galactopyranoside ("CPRG") 

30 (see, e.g., Nielsen et al., 1983 Proc Natl Acad Sci USA 80(17):51 98-5202; Eustice et al., 
1991, Biotechniques 1 1:739-742; and Henderson et al., 1986, Clin. Chem. 32:1637-1641). 
The beta-gal gene functions well as a reporter gene because the protein product is extremely 
stable, resistant to proteolytic degradation in cellular lysates, and easily assayed. When 
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ONPG is used as the substrate, beta-gal activity can be quantitated with a spectrophotometer 
or microplate reader. 

As used herein, the term "beta galactosidase" or "beta-gal" is intended to 
embrace all beta-gals, including lacZ gene products, or recombinant enzymes derived from 
beta-gals which have beta-gal activity. The beta-gal gene functions well as a reporter gene 
because the protein product is extremely stable, resistant to proteolytic degradation in 
cellular lysates, and easily assayed. In an embodiment where ONPG is the substrate, beta- 
gal activity can be quantitated with a spectrophotometer or microplate reader to detemiine 
the amount of ONPG converted at 420 nm. In an embodiment when CPRG is the substrate, 
beta-gal activity can be quantitated with a spectrophotometer or microplate reader to 
detemiine the amount of CPRG converted at 570 to 595 nm. In yet another embodiment, 
the beta-gal activity can be visually ascertained by plating bacterial cells traxisfomied with a 
beta-gal constmct onto plates containing Xgal and IPTG. Bacterial colonies that are dark 
blue indicate the presence of high beta-gal activity and colonies that are varying shades of 
blue indicate varying levels of beta-gal activity. 

5.I.I.4. Beta-Glucuropidase 

Beta-glucuronidase ("GUS") catalyzes the hydrolysis of a very wide variety 
of beta-glucuronides, and, with much lower efficiency, hydrolyzes some beta- 
galacturonides. GUS is very stable, will tolerate many detergents and widely varying ionic 
conditions, has no cofactors, nor any ionic requirements, can be assayed at any 
physiological pH, with an optimum between 5.0 and 7.8, and is reasonably resistant to 
thermal inactivation (see, e.g.^ U.S. Patent No. 5,268,463, which is incorporated by 
reference in its entirety). 

In one embodiment, the GUS is derived from the Esherichia coli beta- 
glucuronidase gene. In alternate embodiments of the invention, the beta-glucuronidase 
encoding nucleic acid is homologous to the E. coli beta-glucuronidase gene and/or may be 
derived from another organism or species. 

GUS activity can be assayed either by fluorescence or spectrometry, or any 

other method described in U.S. Patent No. 5,268,463, the disclosure of which is 

incorporated by reference in its entirety. For a fluorescent assay, 4- 

trifluoromethylumbelliferyl beta-D-glucuronide is a very sensitive substrate for GUS. The 

fluorescence maximum is close to 500 nm— bluish green, where very few plant compounds 

fluoresce or absorb. 4-trifluoromethylumbeIliferyl beta-D-glucuronide also fluoresces much 

more strongly near neutral pH, allowing continuous assays to be perforaied more readily 
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than with MUG. 4-trifluoromethylumbelliferyl beta-D-glucuronide can be used as a 
fluorescent indicator in vivo. The spectrophotometric assay is very straightforward and 
moderately sensitive (Jefferson et al., 1986, Proc. Natl. Acad. Sci. USA 86:8447-8451). A 
preferred substrate for spectrophotometric measurement is p-nitrophenyl beta-D- 
5 glucuronide, which when cleaved by GUS releases the chromophore p-nitrophenol. At a pH 
greater than its pKa (around 7.15) the ionized chromophore absorbs light at 400-420 nm, 
giving a yellow color 

5.1.1.5. Beta-Lactamases 

Beta-lactamases are nearly optimal enzymes in respect to their almost 

10 diffusion-controlled catalysis of beta-lactam hydrolysis, making them suited to the task of 
an intracellular reporter enzyme (see, e.g.^ Christensen et al., 1990, Biochem. J. 266: 853- 
861). They cleave the beta-lactam ring of beta-lactam antibiotics, such as penicillins and 
cephalosporins, generating new charged moieties in the process (see, e.g.y O'Callaghan et 
al., 1968, Antimicrob. Agents. Chemother. 8: 57-63 and Stratton, 1988, J. Antimicrob. 

15 Chemother. 22, Suppl. A: 23-35). A large number of beta-lactamases have been isolated 
and characterized, all of which would be suitable for use in accordance with the present 
invention (see, e.g., Richmond & Sykes, 1978, Adv.Microb.Physiol. 9:31-88 and Ambler, 
1980, Phil. Trans. R. Soc. Lond. [Ser.B.] 289: 321-331, the disclosures of which are 
incorporated by reference in their entireties). 

20 The coding region of an exemplary beta-lactamase employed has been 

described in U.S. Patent No. 6,472,205, Kadonaga et al., 1984, J.Biol.Chem. 259: 2149- 
2154, and Sutcliffe, 1978, Proc. Natl. Acad. Sci. USA 75: 3737-3741, the disclosures of 
which re incorporated by reference in their entireties. As would be readily apparent to those 
skilled in the field, this and other comparable sequences for peptides having beta-lactamase 

25 activity would be equally suitable for use in accordance with the present invention. The 
combination of a fluorogenic substrate described in U.S. Patent Nos. 6,472,205, 5,955,604, 
and 5,741,657, the disclosures of which are incorporated by reference in their entireties, and 
a suitable beta-lactamase can be employed in a wide variety of different assay systems, such 
as are described in U.S. Patent No. 4,740,459, which is hereby incorporated by reference in 

30 its entirety. 

5.1.1.6. Chloramphenicol Acetvltransferase 

Chloramphenicol acetyl transferase ("CAT*) is commonly used as a reporter 
gene in manmialian cell systems because manunalian cells do not have detectable levels of 
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CAT activity. The assay for CAT involves incubating cellular extracts with radiolabeled 
chloramphenicol and appropriate co-factors, separating the starting materials from the 
product by, for example, thin layer chromatography ("TLC'O, followed by scintillation 
counting (see, e.^,, U.S. Patent No. 5,726,041, which is hereby incorporated by reference in 
its entirety). 

As used herein, the term "chloramphenicol acetyltransferase" or "CAT" is 
intended to embrace all CATs, or recombinant enzymes derived from CAT which have 
CAT activity. While it is preferable that a reporter system which does not require cell 
processing, radioisotopes, and chromatographic separations would be more amenable to 
high through-put screening, CAT as a reporter gene may be preferable in situations when 
stability of the reporter gene is important. For example, the CAT reporter protein has an in 
vivo half life of about SO hours, which is advantageous when an accumulative versus a 
dynamic change type of result is desired. 

5.1. 1.7. Secreted Alkaline Phosphatase 

The secreted alkaline phosphatase ("SEAP") enzyme is a truncated form of 
alkaline phosphatase, in which the cleavage of the transmembrane domain of the protein 
allows it to be secreted from the cells into the surrounding media. In a preferred 
embodiment, the alkaline phosphatase is isolated from human placenta. 

As used herein, the term "secreted alkaline phosphatase" or "SEAP" is 
intended to embrace all SEAP or recombinant enzymes derived from SEAP which have 
alkaline phosphatase activity. SEAP activity can be detected by a variety of methods 
including, but not limited to, measurement of catalysis of a fluorescent substrate, 
immunoprecipitation, HPLC, and radiometric detection. The luminescent method is 
preferred due to its increased sensitivity over calorimetric detection methods. The 
advantages of using SEAP is that a cell lysis step is not required since the SEAP protein is 
secreted out of the cell, which facilitates the automation of sampling and assay procedures. 
A cell-based assay using SEAP for use in cell-based assessment of inhibitors of the 
Hepatitis C virus protease is described in U.S. Patent No. 6,280,940 to Potts et al. which is 
hereby incorporated by reference in its entirety. 

5.1,2. Proteins That Regulate the Expression of Reporter Genes 

The invention provides a nucleic acid sequence comprising a nucleotide 
sequence encoding a regulatory protein or a component or subunit thereof, which regulatory 
protein binds to a regulatory element operably linked to a reporter gene and regulates the 
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expression of the reporter gene. The expression of the full-length regulatory protein or 
component or subunit thereof is suppressed or inhibited in the absence of a compound that 
suppresses premature translation termination and/or nonsense-mediated mRNA decay 
because of the presence of a premature stop codon or nonsense mutation within the open 
5 reading frame of the nucleotide sequence encoding the regulatory protein. The expression 
of the full-length regulatory protein or component or subunit thereof is, thus, contingent on 
the suppression of the premature stop codon or nonsense mutation by a compound. As the 
expression of the reporter gene is regulated by a regulatory element responsive to the full- 
length regulatory protein, reporter gene expression should only be detected in the presence 

10 of a compound that suppresses the premature stop codon or nonsense mutation. 

The location of the premature stop codon or nonsense mutation is N-tenninal 
to the native stop codon of the regulatory protein or component or subunit thereof In a 
specific embodiment, the premature stop codon or nonsense mutation is at least 15 
nucleotides, preferably at least 25 nucleotides, at least 50 nucleotides, at least 75 nucleotides 

15 or at least 100 nucleotides from the start codon in the open reading frame of the nucleotide 
sequence encoding the regulatory protein or a component or subunit thereof In another 
embodiment, the premature stop codon or nonsense mutation is at least 15 nucleotides, 
preferably at least 25 nucleotides, at least 50 nucleotides, at leat 75 nucleotides or at least 
100 nucleotides from the native stop codon in the open reading frame of the nucleotide 

20 sequence encoding the regulatory protein or a component or subunit thereof. In another 
embodiment, the premature stop codon in the open reading frame of the nucleotide 
sequences. In another embodiment, the premature stop codon in the open reading frame of 
the nucleotide sequence encoding the regulatory protein or a component or subunit thereof 
is in the context of UGAA, UGAC, UGAG, UGAU, UAGA, UAGC, UAGG, UAGU, 

25 UAAA, UAAC, UAAG or UAAU. In yet another embodiment, the nucleotide sequence 

encoding the regulatory protein or a component or subunit thereof, contains or is engineered 
to contain two, three, four or more stop codons. In another embodiment, the premature stop 
codon in the open reading frame of the nucleotide sequences encoding the regulatory 
protein or a component or subunit thereof is UAG or UGA. 

30 In one embodiment, the invention provides a nucleic acid sequence 

comprising a nucleotide sequence encoding a regulatory protein with a premature stop 
codon or nonsense mutation. In accordance with this embodiment, the nucleic acid 
sequence can encode a naturally-occurring gene with a premature stop codon or nonsense 
mutation or the nucleic acid sequence can be engineered to contain a premature stop codon 
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or nonsense mutation using techniques well-known in the art. In this case, the expression of 
the full-length regulatory protein regulates the expression of the reporter gene which is 
detected by techniques well-known in the art or described herein. 

In another embodiment, the invention provides a first nucleic acid sequence 
and a second nucleic acid sequence, wherein the first nucleic acid sequence comprises a 
nucleotide sequence encoding a first fusion protein comprising (or alternatively, consisting 
of) a DNA binding domain and a first protein, and the second nucleic acid sequence 
comprises a nucleotide sequence encoding a second fusion protein comprising (or 
alternatively, consisting of) an activation domain and a second protein. In accordance with 
this embodiment, the nucleotide sequence encoding the first or second fusion protein may 
contain or be engineered to contain a premature stop codon or nonsense mutation. The first 
fusion protein and second fusion protein interact and produce a regulatory protein when the 
premature stop codon or nonsense mutation is suppressed by a compound. Thus, the 
production of a functional regulatory protein is dependent on suppression of a premature 
stop codon or nonsense mutation. In this case, the production of the functional regulatory 
protein regulates the expression of the reporter gene which is detected by techniques well- 
known in the art or described herein. 

In one embodiment of the invention, the protein that regulates expression of 
a gene contains domains which are associated with various activities related to 
transcriptional regulation, including, but not limited to, binding and activation. In one 
embodiment, a binding domain of a regulatory protein is one that recognizes and 
specifically associates with a sequence of at least two nucleotides of a nucleic acid. Nucleic 
acids that can be recognized by a binding domain of a protein, include, but are not limited 
to, DNA and RNA both single and multiple stranded. In a more specific embodiment, a 
binding domain can adopt one of a number of conformations or motifs, known in the art, 
including but not limited to, zinc finger, leucine zipper, helix turn helix and helix loop helix. 
In a more preferred embodiment, the binding domain protein is one that specifically 
recognizes a region of a nucleic acid. Such recognition can occur through a number of 
interactions, including, but not limited to, covalent, hyrophobic and van der Waals. In 
another embodiment, an activation domain is one that modulates, regulates, enhances, 
suppresses or controls the expression of a gene. In such an embodiment, the activation 
domain can modulate, regulate, enhance, suppress or control the expression of a gene by 
interacting, either directly or indirectly, with other compounds or proteins that are required 
or involved in gene expression. In one embodiment, such domains can be expressed as 
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proteins that are fused with other proteins suitable for the described assays. For example, in 
one embodiment, the activation domain of a regulatory protein is expressed as part of a 
protein or polypeptide encoded by a nucleic acid, and the binding domain of a regulatory 
protein is expressed as a part of a protein or polypeptide encoded by another nucleic acid. 
5 In a more specific embodiment, such binding and regulatory domains are expressed as 
fusion proteins with other proteins with properties that are suitable to the assay. In an 
example of an embodiment suitable to the described assays, the binding and regulatory 
domains are expressed on separate fusion proteins with proteins that interact with each 
other. For example, the binding domain can be expressed as a chimeric protein that is fused 

10 to another protein known to associate with another protein that is expressed from a separate 
nucleic acid and fused to the activation domain. In such an embodiment, a regulatory 
complex is formed by the association between the binding domain and the activation 
domain expressed as parts of the described fiision proteins. Interaction between the two 
domains can be mediated or initiated by a number of means, preferably through inter or 

15 intra molecular associations between the parts of the described fusion proteins that are 
known to interact with one another. Examples of proteins or complexes that contain 
domains that bind to nucleic acids in addition to possesing regulatory functions include, but 
are not limited to, GAL4, glucocorticoid and estrogen receptors (GR and ER), Xfin protein, 
GCN4, and the transcription factor Max in complex with oncogene Myc. 

20 The invention relates to the identification of compounds that modulate 

premature translation termination or nonsense-mediated mRNA decay, using, in some 
instances, a reporter based assay. The invention provides for the identification of 
compounds that modulated premature translation termination via a nonsense stop codon in a 
nucleic acids. Such nucleic acids include, but are not limited to, DNA and RNA. In a more 

25 certain embodiment, the nucleic acid is RNA. In another embodiment, the nucleic acid is 
single stranded. In other embodiments, the nucleic acids are single stranded. In yet other 
embodiments, the nucleic acids are more than single stranded, e.g., double, triple or 
quadruple stranded. 

5.1.3. Stop Codons 

30 The present invention provides for methods for screening and identifying 

compounds that modulate premature translation termination and/or nonsense-mediated 

mRNA decay. A reporter gene may be engineered to contain a premature stop codon or 

may naturally contain a premature stop codon. Alternatively, a protein, polypeptide or 

peptide that regulates (directly or indirectly) the expression of a reporter gene may be 
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engineered to contain or may naturally contain a premature stop codon. The premature stop 
codon may any one of the stop codons known in the art including UAG, UAA and UGA. 

The stop codons are UAG, UAA, and UGA, signals to the ribosome to 
temiinate protein synthesis, presumably through protein release factors. Even though the 
use of these stop codons is widespread, they are not universal. For example, UGA specifies 
tryptophan in the mitochondria of mammals, yeast, Neurospora crassa, Drosophila, 
protozoa, and plants (see, e.g.^ Breitenberger & RajBhandary, 1985, Trends Biochem Sci 
10:481). Other examples include the use of UGA for tryptophan in Mycoplasma and, in 
ciliated protozoa, the use of UAA and UAG for glutamine (see, e.g.. Jukes et aL, 1987, Cold 
Spring Harb Symp Quant Biol. 52:769-776), the use of UGA for cysteine in the ciliate 
Euplotes aediculatus (see, eg., Kervestin et al., 2001, EMBO Rep 2001 Aug;2(8):680-684), 
the use of UGA for tryptophan in Blepharisma americanum and the use of UAR for 
glutamine in Tetrahymena, and three spirotrichs, Stylonychia lenmae, S. mytilus, and 
Oxytricha trifallax (see, e.g. , Lozupone et al., 2001 , Curr Biol 1 1 (2):65-74). It has been 
proposed that the ancestral mitochondrion was bearing the universal genetic code and 
subsequently reassigned the UGA codon to tryptophan independently, at least in the lineage 
of ciliates, kinetoplastids, rhodophytes, prymnesiophytes, and fimgi (see, e.g., Inagaki et al., 
1998, J Mol Evol 47(4):378-384). 

The readthrough of stop codons also occurs in positive-sense ssRNA viruses 
by a variety of naturally occurring suppressor tRNAs. Such naturally-occurring suppressor 
tRNAs include, but are not limited to, cytoplasmic tRNATyr, which reads through the UAG 
stop codon; cytoplasmic tRNAsGhi, which read through UAG and UAA; cytoplasmic 
tRNAsLeu, which read through UAG; chloroplast and cytoplasmic tRNAsTrp, which read 
through UGA; chloroplast and cytoplasmic tRNAsCys, which read through UGA; 
cytoplasmic tRNAsArg, which read through UGA (see, e.g., Beier 8l Grimm, 2001, Nucl 
Acids Res 29(23):4767-4782 for a review); and the use of selenocysteme to suppress UGA 
in E. coli (see, e.g.. Baron & Bock, 1995, The selenocysteine inserting tRNA species: 
structure and function. In S6ll,D. and RajBhandary,U.L. (eds), tRNA: Structure, 
Biosynthesis and Function, ASM Press, Washington, DC, pp. 529 544). The mechanism is 
thought to involve unconventional base interactions and/or codon context effects. 

As described above, the stop codons are not necessarily universal, with 
consideration variation amongst organelles {e.g.^ mitochondria and chloroplasts), viruses 
{e.g., single strand viruses), and protozoa {e.g., ciliated protozoa) as to whether the codons 
UAG, UAA, and UGA signal translation teraiination or encode amino acids. Even though a 
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single release factor most probably recognizes all of the stop codons in eucaryotes, it 
appears that all of the stop codons are not suppressed in a similar matter. For example, in 
the yeast Saccharomyces pombCy nonsense suppression has to be strictly codon specific 
(see, e.g., Hottinger et al., 1984, EMBO J 3:423-428). In another example, significant 
5 differences were found in the degree of suppression amongst three UAG codons and two 
UAA codons in different mRNA contexts in Escherichia coli and in human 293 cells, 
although data suggested that the context effects of nonsense suppression operated 
differently in E, coli and human cells (see, e.g., Martin et aL, 1989, Mol Gen Genet 217(2 
3):41 1 8). Since unconventional base interactions and/or codon context effects have been 

10 implicated in nonsense suppression, it is conceivable that compounds involved in nonsense 
suppression of one stop codon may not necessarily be involved in nonsense suppression of 
another stop codon. In other words, compounds involved in suppressing UAG codons may 
not necessarily be involved in suppressing UGA codons. 

In a specific embodiment, a reporter gene or a gene encoding a protein, 

1 5 polypeptide or peptide that regulates the expression of a reporter gene contains or is 

engineered to contain the premature stop codon UAG. In another embodiment, a reporter 
gene or a gene encoding a protein, polypeptide or peptide that regulates the expression of a 
reporter gene contains or is engineered to contain the premature stop codon UGA. In yet 
another embodiment, a reporter gene or a gene encoding a protein, polypeptide or peptide 

20 that regulates the expression of a reporter gene contains or is engineered to contain a 

premature stop codon in the context of UGAA, UGAC, UGAG, UGAU, UAGA, UAGC, 
UAGG, UAGU, UAAA, UAAC, UAAG or UAAU. 

In a particular embodiment, a reporter gene or a gene encoding a protein, 
polypeptide or peptide that regulates the expression of a reporter gene contains or is 

25 engineered to contain two, three, four or more stop codons. In accordance with this 

embodiment, the stop codons are preferably at least 10 nucleotides, at least 15 nucleotides, 
at least 20 nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 35 
nucleotides, at least 40 nucleotides, at least 45 nucleotides, at least 50 nucleotides, at least 
75 nucleotides or at least 100 nucleotides apart from each other. Further, in accordance 

30 with this embodiment, at least one of the stop codons is preferably UAG or UGA. 

In a specific embodiment, a reporter gene or a gene encoding a protein, 
polypeptide or peptide that regulates the expression of a reporter gene contains or is 
engineered to contain a premature stop codon at least 15 nucleotides, preferably at least 20 
nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 35 nucleotides, at least 
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40 nucleotides, at least 45 nucleotides, at least SO nucleotides or at least 75 nucleotides from 
the start codon in the coding sequence. In another embodiment, a reporter gene or a gene 
encoding a protein, polypeptide or peptide that regulates the expression of a reporter gene 
contains or is engineered to contain a premature stop codon at least 15 nucleotides, 
preferably at least 25 nucleotides, at least 50 nucleotides, at least 75 nucleotides, at least 100 
nucleotides, at least 125 nucleotides, at least 150, at least 175 nucleotides or at least 200 
nucleotides from the native stop codon in the coding sequence of the full-length reporter 
gene product or protein, polypeptide or peptide. In another embodiment, a reporter gene or 
a gene encoding a protein, polypeptide or peptide that regulates the expression of a reporter 
gene contains or is engineered to contain a premature stop codon at least 15 nucleotides 
(preferably at least 20 nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 35 
nucleotides, at least 40 nucleotides, at least 45 nucleotides, at least 50 nucleotides or at least 
75 nucleotides) from the start codon in the coding sequence and at least 15 nucleotides 
(preferably at least 25 nucleotides, at least 50 nucleotides, at least 75 nucleotides, at least 
100 nucleotides, at least 125 nucleotides, at least 150, at least 175 nucleotides or at least 200 
nucleotides) from the native stop codon in the coding sequence of the full-length reporter 
gene product or protein, polypeptide or peptide. In accordance with these embodiments, the 
premature stop codon is preferably UAG or UGA. 

The premature translation stop codon can be produced by in vitro mutagenesis 
techniques such as, but not limited to, polymerase chain reaction ("PGR"), linker insertion, 
oligonucleotide-mediated mutagenesis, and random chemical mutagenesis. 

5.1.4. Vectors 

The nucleotide sequence encoding for a protein, polypeptide or peptide (e.^., 
a reporter gene, or a protein, polypeptide or peptide that regulates the expression of a 
reporter gene) can be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for the transcription and translation of the inserted protein- 
coding sequence. The necessary transcriptional and translational elements can also be 
supplied by the protein, polypeptide or peptide. The regulatory regions and enhancer 
elements can be of a variety of origins, both natural and synthetic. In a specific 
embodiment, a reporter gene is operably linked to regulatory element that is responsive to a 
regulatory protein whose expression is dependent upon the suppression of a premature stop 
codon. 

A variety of host-vector systems may be utilized to express a protein, 

polypeptide or peptide. These include, but are not limited to, mammalian cell systems 
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infected with virus (e,g.y vaccinia virus, adenovirus, etc.); insect cell systems infected with 
virus (e.^., baculovirus); microorganisms such as yeast containing yeast vectors, or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA; and stable cell lines 
generated by transformation using a selectable marker. The expression elements of vectors 
5 vary in their strengths and specificities. Depending on the host-vector system utilized, any 
one of a number of suitable transcription and translation elements may be used. 

Any of the methods previously described for the insertion of DNA fragments 
into a vector may be used to construct expression vectors containing a chimeric nucleic acid 
consisting of appropriate transcriptional/translational control signals and the protein coding 

10 sequences. These methods may include in vitro recombinant DNA and synthetic techniques 
and in vivo recombinants (genetic recombination). Expression of a first nucleic acid 
sequence encoding a protein, polypeptide or peptide, such as a reporter gene, may be 
regulated by a second nucleic acid sequence so that the first nucleic acid sequence is 
expressed in a host transformed with the second nucleic acid sequence. For example, 

15 expression of a nucleic acid sequence encoding a protein, polypeptide or peptide, such as a 
reporter gene, may be controlled by any promoter/enhancer element known in the art, such 
as a constitutive promoter, a tissue-specific promoter, or an inducible promoter. Specific 
examples of promoters which may be used to control gene expression include, but are not 
limited to, the SV40 early promoter region (Bemoist & Chambon, 1981, Nature 290:304- 

20 310), the promoter contained in the 3 ' long terminal repeat of Rous sarcoma virus 

(Yamamoto et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter (Wagner 
et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences of the 
metallothionein gene (Brinster et al., 1982, Nature 296:39-42); prokaryotic expression 
vectors such as the jS-lactamase promoter (Villa-Kamaroff et al., 1978, Proc. Natl. Acad. 

25 Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer et al., 1983, Proc. Natl. Acad. Sci. 
U.S.A. 80:21-25); see also 'TJsefiil proteins from recombinant bacteria" in Scientific 
American, 1980, 242:74-94; plant expression vectors comprising the nopaline synthetase 
promoter region (Herrera-Estrella et al.. Nature 303:209-213) or the cauliflower mosaic 
virus 35S RNA promoter (Gardner, et al., 1981, Nucl. Acids Res. 9:2871), and the promoter 

30 of the photosynthetic enzyme ribulose biphosphate carboxylase (Herrera-Estrella et al., 

1984, Nature 310:1 15-120); promoter elements from yeast or other fungi such as the Gal 4 
promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter, and the following animal transcriptional control 
regions, which exhibit tissue specificity and have been utilized in transgenic animals: 
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elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984, 
Cell 38:639-646; Omitz et al., 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987, Hepatology 7:425-515); insulin gene control region which is active in 
pancreatic beta cells (Hanahan, 1985, Nature 315:1 15-122), immunoglobulin gene control 
5 region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames 
et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), 
mouse mammary tumor virus control region which is active in testicular, breast, lymphoid 
and mast cells (Leder et al., 1986, Cell 45:485-495), albumin gene control region which is 
active in liver (Pinkert et al., 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene 

10 control region which is active in liver (Knunlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; 
Hammer et al., 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is 
active in the liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene 
control region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; 
KoUias et al., 1986, Cell 46:89-94; myelin basic protein gene control region which is active 

15 in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703-712); myosin light 
chain-2 gene control region which is active in skeletal muscle (Sani, 1985, Nature 314:283- 
286), and gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et aL, 1986, Science 234:1372-1378). 

In a specific embodiment, a vector is used that comprises a promoter 

20 operably linked to a reporter gene, one or more origins of replication, and, optionally, one or 
more selectable markers (e.g.j an antibiotic resistance gene). In a preferred embodiment, 
the vectors are CMV vectors, T7 vectors, lac vectors, pCEP4 vectors, 5.0/F vectors, or 
vectors with a tetracycline-regulated promoter (e.g., pcDNA^5/FRT/TO from Invitrogen). 
Some vectors may be obtained commercially. Non-limiting examples of useful vectors are 

25 described in Appendix 5 of Current Protocols in Molecular Biology, 1988, ed. Ausubel et 
al., Greene Publish. Assoc. & Wiley Interscience, which is incorporated herein by reference; 
and the catalogs of commercial suppliers such as Clontech Laboratories, Stratagene Inc., 
and Invitrogen, Inc. 

Expression vectors containing a construct of the present invention can be 

30 identified by the following general approaches: (a) nucleic acid hybridization, (b) presence 
or absence of "marker** nucleic acid fimctions, (c) expression of inserted sequences, and (d) 
sequencing. In the first approach, the presence of a particular nucleic acid sequence 
inserted in an expression vector can be detected by nucleic acid hybridization using probes 
comprising sequences that are homologous to the inserted nucleic acid sequence. In the 

-41 - 



wo 2004/001010 PCT/US2003/019760 

second approach, the recombinant vector/host system can be identified and selected based 
upon the presence or absence of certain "marker" nucleic acid functions (e.g., thymidine 
kinase activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of the nucleic acid sequence of 
5 interest in the vector. For example, if the nucleic acid sequence of interest is inserted within 
the marker nucleic acid sequence of the vector, recombinants containing the insert can be 
identified by the absence of the marker nucleic acid function. In the third approach, 
recombinant expression vectors can be identified by assaying the product expressed by the 
recombinant. Such assays can be based, for example, on the physical or fimctional 

10 properties of the particular nucleic acid sequence. 

In a preferred embodiment, nucleic acid sequences encoding proteins, 
polypeptides or peptides are cloned into stable cell line expression vectors. In a preferred 
embodiment, the stable cell line expression vector contains a site specific genomic 
integration site. In another preferred embodiment, the reporter gene construct is cloned into 

15 an episomal mammalian expression vector. 

5.1.5. Transfection 

Once a vector encoding the appropriate gene has been synthesized, a host 
cell is transformed or transfected with the vector of interest. The use of stable transformants 
is preferred. In a preferred embodiment, the host cell is a mammalian cell. In a more 

20 preferred embodiment, the host cell is a human cell. In another embodiment, the host cells 
are primary cells isolated from a tissue or other biological sample of interest. Host cells that 
can be used in the methods of the present invention include, but are not limited to, 
hybridomas, pre-B cells, 293 cells, 293T cells, HeLa cells, HepG2 cells, K562 cells, 3T3 
cells. In another preferred embodiment, the host cells are derived from tissue specific to the 

25 nucleic acid sequence encoding a protein, polypeptide or peptide. In another preferred 

embodiment, the host cells are immortalized cell lines derived firom a source, e,g., a tissue. 
Other host cells that can be used in the present invention include, but are not limited to, 
bacterial cells, yeast cells, virally-infected cells, or plant cells. 

Preferred mammalian host cells include but are not limited to those derived 

30 from humans, monkeys and rodents, (see, for example, Kriegler M. in "Gene Transfer and 

Expression: A Laboratory Manual", New York, Freeman & Co. 1990), such as monkey 

kidney cell line transformed by SV40 (COS-7, ATCC Accession No. CRL 1651); human 

embryonic kidney cell lines (293, 293-EBNA, or 293 cells subcloned for growth in 

suspension culture, Graham et al., J. Gen. Virol., 36:59, 1977; baby hamster kidney cells 
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(BHK, ATCC Accession No. CCL 10); Chinese hamster ovary-cells-DHFR (CHO. Urlaub 
and Chasin. Proc. Natl. Acad. Sci. 77; 4216, 1980); mouse Sertoli cells (Mather, Biol. 
Reprod. 23:243-251, 1980); mouse fibroblast cells (NIH-3T3), monkey kidney cells (CVI 
ATCC Accession No. CCL 70); african green monkey kidney cells (VERO-76, ATCC 
Accession No. CRL-1587); human cervical carcinoma cells (HELA, ATCC Accession No. 
CCL 2); canine kidney cells (MDCK, ATCC Accession No. CCL 34); buffalo rat liver cells 
(BRL 3A, ATCC Accession No. CRL 1442); human lung cells (W138, ATCC Accession 
No. CCL 75); human liver cells (Hep G2, HB 8065); and mouse mammary tumor cells 
(MMT 060562, ATCC Accession No. CCL51). 

Other useful eukaryotic host- vector system may include yeast and insect 
systems. In yeast, a number of vectors containing constitutive or inducible promoters may 
be used with Saccharomyces cerevisiae (baker's yeast), Schizosaccharomyces pombe 
(fission yeast), Pichia pastoris, and Hansenula polymorpha (methylotropic yeasts). For a 
review see. Current Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel et al., 
Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant et al, 1987, Expression and 
Secretion Vectors for Yeast, in Methods in Enzymology, Eds. Wu & Grossman, 1987, 
Acad. Press, N.Y„ Vol. 153, pp. 516-544; Glover, 1986, DNA Cloning, Vol. U, IRL Press, 
Wash., D.C., Ch, 3; and Bitter, 1987, Heterologous Gene Expression in Yeast, Methods in 
Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol. 152, pp. 673-684; and The 
Molecular Biology of the Yeast Saccharomyces^ 1982, Eds. Strathem et al.. Cold Spring 
Harbor Press, Vols. I and II. 

Standard methods of introducing a nucleic acid sequence of interest into host 
cells can be used. Transformation may be by any known method for introducing 
polynucleotides into a host cell, including, for example packaging the polynucleotide in a 
virus and transducing a host cell with the vims, and by direct uptake of the polynucleotide. 
The transformation procedure used depends upon the host to be transformed. Mammalian 
transformations {Le., transfections) by direct uptake may be conducted using the calcium 
phosphate precipitation method of Graham & Van der Eb, 1978, Virol. 52 :546, or the 
various known modifications thereof. Other methods for introducing recombinant 
polynucleotides into cells, particularly into mammalian cells, include dextran-mediated 
transfection, calcium phosphate mediated transfection, polybrene mediated transfection, 
protoplast fusion, electroporation, encapsulation of the polynucleotide(s) in liposomes, and 
direct microinjection of the polynucleotides into nuclei. Such methods are well-known to 
one of skill in the art. 
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In a preferred embodiment, stable cell lines containing the constructs of 
interest are generated for high throughput screening. Such stable cells lines may be 
generated by introducing a construct comprising a selectable marker, allowing the cells to 
grow for 1-2 days in an enriched medium, and then growing the cells on a selective 
5 medium. The selectable marker in the recombinant plasmid confers resistance to the 

selection and allows cells to stably integrate the plasmid into their chromosomes and grow 
to form foci which in turn can be cloned and expanded into cell lines. 

A number of selection systems may be used, including but not limited to the 
herpes simplex virus thymidine kinase (Wigler, et al., 1977, Cell 1 1 :223), hypoxanthine- 

10 guanine phosphoribosyltransferase (Szybalska & Szybalski, 1962, Proc. Natl Acad. Sci. 
USA 48:2026), and adenine phosphoribosyltransferase (Lowy, et al., 1980, Cell 22:817) 
genes can be employed in tk-, hgprt- or aprt- cells, respectively. Also, anti-metabolite 
resistance can be used as the basis of selection for dhfir, which confers resistance to 
methotrexate (Wigler, et al., 1980, Natl. Acad. Sci. USA 77:3567; O'Hare, et al., 1981, 

15 Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to mycophenolic acid 
(Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J. Mol. Biol. 
150:1); and hygro, which confers resistance to hygromycin (Santerre, et al., 1984, Gene 
30:147) genes. 

20 5.1.6, Cell-Free Extracts 

The invention provides for the translation of a nucleic acid sequence 
encoding a protein, polypeptide or peptide (with or without a premature translation stop 
codon) in a cell-free system. Techniques for practicing the specific aspect of this invention 
will employ, unless otherwise indicated, conventional techniques of molecular biology, 

25 microbiology, and recombinant DNA manipulation and production, which are routinely 
practiced by one of skill in the art. See, e.g., Sambrook, 1989, Molecular Cloning, A 
Laboratory Manual, Second Edition; DNA Cloning, Volumes I and II (Glover, Ed. 1985); 
and Transcription and Translation (Hames & Higgins, Eds. 1984). 

Any technique well-known to one of skill in the art may be used to generate 

30 cell-free extracts for translation. For example, the cell-free extracts can be generated by 

centrifuging cells and clarifying the supernatant. In one embodiment, the cells are 

incubated on ice during the preparation of the cell-free extract. In another embodiment, the 

cells are incubated on ice at least 12 hours, at least 24 hours, at least two days, at least five 

days, at least one week, at least longer than one week. In a more specific embodiment, the 
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cells are incubated on ice at least long enough so as to improve the translation activity of the 
cell extract in comparison to cell extracts that are not incubated on ice. In yet another 
embodiment, the cells are incubated at a temperature between about 0 °C and 10 °C. In a 
preferred embodiment, the cells are incubated at about 4 °C. 

In another preferred embodiment, the cells are centrifiiged at a low speed to 
isolate the cell-free extract for in vitro translation reactions. In a preferred embodiment, the 
cell extract is the supernatant from cells that are centrifiiged at about 2 x g to 20.000 x g. In 
a more preferred embodiment, the cell extract is the supernatant from cells that are 
centrifiiged at about 5 x g to 15,000 x g. In an even more preferred embodiment, the cell 
extract is the supernatant from cells that are centrifiiged at about 10,000 x g. Alternatively, 
in a preferred embodiment, the cell-free extract is about the S 1 to S50 extract. In a more 
preferred embodiment, the cell extract is about the S5 to S25 extract. In an even more 
preferred embodiment, the cell extract is about the S 10 extract. 

The cell-free translation extract may be isolated from cells of any species 
origin. In another embodiment, the cell-free translation extract is isolated from yeast, 
cultured mouse or rat cells, Chinese hamster ovary (CHO) cells, Xenopus oocytes, 
reticulocytes, wheat germ, or rye embryo (see, e.g., Krieg & Melton, 1984, Nature 308:203 
and Dignam etal., 1990 Methods Enzymol. 182:194-203). Alternatively, the cell-free 
translation extract, e.g., rabbit reticulocyte lysates and wheat germ extract, can be purchased 
from, e.g., Promega, (Madison, WI). In another embodiment, the cell-free translation 
extract is prepared as described in International Patent Publication No. WO 01/44516 and 
U.S. Patent No. 4,668,625 to Roberts, the disclosures of which are incorporated by 
reference in their entireties. In a preferred embodiment, the cell-free extract is an extract 
isolated from human cells. In a more preferred embodiment, the human cells are HeLa 
cells. It is preferred that the endogenous expression of the genes with the premature 
translation codons is minimal, and preferably absent, in the cells from which the cell-free 
translation extract is prepared. 

Systems for the in vitro transcription of RNAs with the gene of interest 
cloned in an expression vectors using promoters such as, but not limited to, Sp6, T3, or T7 
promoters (see, e.g., expression vectors from Invitrogen, Carlesbad, CA; Promega, 
Madison, WI; and Stratagene, La JoUa, CA), and the subsequent transcription of the gene 
with the appropriate polymerase are well-known to one of skill in the art (see, e.g., 
Contreras et al. 1982, Nucl. Acids. Res. 10:6353). In another embodiment, the gene 
encoding the premature stop codon can be PCR-amplified with the appropriate primers, 
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with the sequence of a promoter, such as but not limited to, Sp6, T3, or T7 promoters, 
incorporated into the upstream primer, so that the resulting amplified PGR product can be in 
vitro transcribed with the appropriate polymerase. 

Alternatively, a coupled transcription-translation system can be used for the 
expression of a gene encoding a premature stop codon in a cell free extract, such as the 
TnT® Coupled Transcription/Translation System (Promega, Madison, WI) or the system 
described in U.S. Patent No. 5.895.753 to Mierendorf e/ a/., which is incorporated by 
reference in its entirety. 

5.2. Compounds 

Libraries screened using the methods of the present invention can comprise a 
variety of types of compounds. Examples of libraries that can be screened in accordance 
with the methods of the invention include, but are not limited to, peptoids; random 
biooligomers; diversomers such as hydantoins, benzodiazepines and dipeptides; vinylogous 
polypeptides; nonpeptidal peptidomimetics; oligocarbamates; peptidyl phosphonates; 
peptide nucleic acid libraries; antibody libraries; carbohydrate libraries; and small molecule 
libraries (preferably, small organic molecule libraries). In some embodiments, the 
compounds in the libraries screened are nucleic acid or peptide molecules. In a non-limiting 
example, peptide molecules can exist in a phage display Ubrary. In other embodiments, the 
types of compounds include, but are not limited to, peptide analogs including peptides 
comprising non-naturally occurring amino acids, e.g., D-amino acids, phosphorous analogs 
of amino acids, such as ot-amino phosphoric acids and oc-amino phosphoric acids, or amino 
acids having non-peptide linkages, nucleic acid analogs such as phosphorothioates and 
PNAs, hormones, antigens, synthetic or naturally occurring drugs, opiates, dopamine, 
serotonin, catecholamines, thrombin, acetylcholine, prostaglandins, organic molecules, 
pheromones, adenosine, sucrose, glucose, lactose and galactose. Libraries of polypeptides 
or proteins can also be used in the assays of the invention. In some embodiments, the 
compounds are nucleic acid or peptide molecules. In a non-limiting example, peptide 
molecules can exist in a phage display library. 

In a preferred embodiment, the combinatorial libraries are small organic 
molecule libraries, such as, but not limited to, benzodiazepines, isoprenoids, 
thiazolidinones, metathiazanones, pyrrolidines, morpholino compounds, and 
benzodiazepines. In another embodiment, the combinatorial libraries comprise peptoids; 
random bio-oligomers; benzodiazepines; diversomers such as hydantoins, benzodiazepines 
and dipeptides;, vinylogous polypeptides; nonpeptidal peptidomimetics; oligocarbamates; 
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peptidyl phosphonates; peptide nucleic acid libraries; antibody libraries; or carbohydrate 
libraries. Combinatorial libraries are themselves commercially available (see, e.g., 
ComGenex, Princeton, New Jersey; Asinex. Moscow, Ru, Tripos, Inc., St. Louis, Missouri; 
ChemStar, Ltd, Moscow, Russia; 3D Pharmaceuticals, Exton, Pennsylvania; Martek 
5 Biosciences, Columbia, Maryland; etc.). 

In a preferred embodiment, the library is preselected so that the compounds 
of the library are more amenable for cellular uptake. For example, compounds are selected 
based on specific parameters such as, but not limited to, size, lipophilicity, hydrophilicity, 
and hydrogen bonding, which enhance the likelihood of compounds getting into the cells. 
) In another embodiment, the compounds are analyzed by three-dimensional or four- 
dimensional computer computation programs. 

In one embodiment, the combinatorial compound library for the methods of 
the present invention may be synthesized. There is a great interest in synthetic methods 
directed toward the creation of large collections of small organic compounds, or libraries, 
which could be screened for pharmacological, biological or other activity. The synthetic 
methods applied to create vast combinatorial libraries are performed in solution or in the 
solid phase, i.e., on a solid support. Solid-phase synthesis makes it easier to conduct 
multi-step reactions and to drive reactions to completion with high yields because excess 
reagents can be easily added and washed away after each reaction step. Solid-phase 
combinatorial synthesis also tends to improve isolation, purification and screening. 
However, the more traditional solution phase chemistry supports a wider variety of organic 
reactions than solid-phase chemistry. 

Combinatorial compound libraries of the present invention may be 
synthesized using the apparatus described in U.S. Patent No. 6.190,619 to Kilcoin et al., 
which is hereby incorporated by reference in its entirety. U.S. Patent No. 6,190,619 
discloses a synthesis apparatus capable of holding a plurality of reaction vessels for parallel 
synthesis of multiple discrete compounds or for combinatorial libraries of compounds. 

In one embodiment, the combinatorial compound library can be synthesized 
in solution. The method disclosed in U.S. Patent No. 6,194,612 to Boger et al., which is 
hereby incorporated by reference in its entirety, features compounds useful as templates for 
solution phase synthesis of combinatorial libraries. The template is designed to pemiit 
reaction products to be easily purified fi^om unreacted reactants using liquid/liquid or 
solid/liquid extractions. The compounds produced by combinatorial synthesis using the 
template will preferably be small organic molecules. Some compounds in the library may 
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mimic the effects of non-peptides or peptides. In contrast to solid phase synthesize of 
combinatorial compound libraries, liquid phase synthesis does not require the use of 
specialized protocols for monitoring the individual steps of a multistep solid phase synthesis 
(Egner et al., 1995, J.Org. Chem. 60:2652; Anderson et al, 1995, J. Org. Chem. 60:2650; 
Fitch et al, 1994, J. Org. Chem. 59:7955; Look et al, 1994. J. Org. Chem. 49:7588; 
Metzger et al., 1993, Angew. Chem., Int. Ed. Engl. 32:894; Youngquist et al, 1994, Rapid 
Commun. Mass Spect. 8:77; Chu et al, 1995, J. Am. Chem. Soc. 1 17:5419; Brummel et al, 
1994, Science 264:399; Stevanovic et al, 1993, Bioorg. Med. Chem. Lett. 3:431). 

Combinatorial compound libraries useful for the methods of the present 
invention can be synthesized on solid supports. In one embodiment, a split synthesis 
method, a protocol of separating and mixing solid supports during the synthesis, is used to 
synthesize a library of compounds on solid supports (see e.g.. Lam et al, 1997, Chem. Rev. 
97:41-448; Ohlmeyer et al, 1993, Proc. Natl. Acad. Sci. USA 90:10922-10926 and 
references cited therein). Each solid support in the final library has substantially one type of 
compound attached to its surface. Other methods for synthesizing combinatorial libraries 
on solid supports, wherein one product is attached to each support, will be known to those 
of skill in the art {see. e.g., Nefzi et al, 1997. Chem. Rev. 97:449-472). 

As used herein, the term "solid support" is not limited to a specific type of 
soUd support. Rather a large number of supports are available and are known to one skilled 
in the art. Solid supports include silica gels, resins, derivatized plastic films, glass beads, 
cotton, plastic beads, polystyrene beads, alumina gels, and polysaccharides. A suitable solid 
support may be selected on the basis of desired end use and suitability for various synthetic 
protocols. For example, for peptide synthesis, a solid support can be a resin such as p- 
methylbenzhydrylamine (pMBHA) resin (Peptides International, Louisville, KY), 
polystyrenes (e.g., PAM-resin obtained from Bachem Inc., Peninsula Laboratories, etc.), 
including chlon)methylpolystyrene, hydroxymethylpolystyrene and 
aminomethylpolystyrene, poly (dimethylacrylamide)-grafted styrene co-divinyl-benzene 
(e.g., POLYHIPE resin, obtained from Aminotech, Canada), polyamide resin (obtained 
from Peninsula Laboratories), polystyrene resin grafted with polyethylene glycol (e.g., 
TENTAGEL or ARGOGEL, Bayer, Tubingen, Germany) polydimethylacrylamide resin 
(obtained from Milligenmiosearch, California), or Sepharose (Pharmacia, Sweden). 

In some embodiments of the present invention, compounds can be attached 
to solid supports via linkers. Unkers can be integral and part of the solid support, or they 
may be nonintegral that are either synthesized on the solid support or attached thereto after 
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synthesis. Linkers are useful not only for providing points of compound attachment to the 
solid support, but also for allowing different groups of molecules to be cleaved from the 
solid support under different conditions, depending on the nature of the linker. For 
example, linkers can be, inter alia^ electrophilically cleaved, nucleophilically cleaved, 
photocleavable, enzymatically cleaved, cleaved by metals, cleaved under reductive 
conditions or cleaved under oxidative conditions. In a preferred embodiment, the 
compounds are cleaved from the solid support prior to high throughput screening of the 
compounds. 

In certain embodiments of the invention, the compound is a small molecule. 

5.3. Reporter Gene-Based Screening Assays 

Various in vitro assays can be used to identify and verify the ability of a 
compound to modulate premature translation termination and/or nonsense-mediated mRNA 
decay. Multiple in vitro assays can be performed simultaneously or sequentially to assess 
the affect of a compound on premature translation termination and/or nonsense-mediated 
mRNA decay. In a preferred embodiment, the in vitro assays described herein are 
performed in a high throughput format {e.g.^ in microtiter plates). 

5.3.1. Cell-Based Assays 

After a vector containing the reporter gene construct and/or a vector(s) 
containing a nucleic acid sequence comprising a regulatory protein, a component or a 
subunit thereof is transformed or transfected into a host cell and a compound library is 
synthesized or purchased or both, the cells are used to screen the library to identify 
compoimds that modulate premature translation termination and/or nonsense-mediated 
mRNA decay. The reporter gene-based assays may be conducted by contacting a 
compound or a member of a library of compounds with a cell (e.g^., a genetically engineered 
cell) containing a reporter gene construct comprising a reporter gene containing within the 
open reading frame of the reporter gene a premature stop codon or nonsense mutation; and 
measuring the expression and/or activity of the reporter gene. The reporter gene cell-based 
assays may also be conducted by: (a) contacting a compound with a cell containing a first 
nucleic acid sequence and a second nucleic acid sequence, wherein the first nucleic acid 
sequence comprises a regulatory element operably linked to a reporter gene and the second 
nucleic acid sequence comprises a nucleotide sequence with a premature stop codon or 
nonsense mutation that encodes a regulatory protein that binds to the regulatory element of 
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the first nucleic acid sequence and regulates the expression of the reporter gene; and (b) 
measuring the expression and/or activity of the reporter gene. 

The reporter gene cell-based assays may also be conducted by: (a) contacting 
a compound with a cell containing a first nucleic acid sequence, a second nucleic acid 
5 sequence and a third nucleic acid sequence, wherein (i) the first nucleic acid sequence 

comprises a nucleotide sequence encoding a first fiision protein comprising a DNA binding 
domain and a first protein, the nucleotide sequence of the DNA binding domain or the first 
protein containing a premature stop codon or nonsense mutation, (ii) the second nucleic acid 
sequence comprises a nucleotide sequence encoding a second fusion protein comprising an 

10 activation domain and a second protein, the second protein interacting with the first protein 
to produce a regulatory protein, and (iii) the third nucleic acid sequence comprises a 
regulatory element operably linked to a reporter gene, the expression of the reporter gene 
being regulated by the binding of the regulatory protein to the regulatory element: and (b) 
measuring the expression and/or activity of the reporter gene. Further, the reporter gene 

1 5 cell-based assays may also be conducted by: (a) contacting a compound with a cell 

containing a first nucleic acid sequence, a second nucleic acid sequence and a third nucleic 
acid sequence, wherein (i) the first nucleic acid sequence comprises a nucleotide sequence 
encoding a first fusion protein comprising a DNA binding domain and a first protein, (ii) the 
second nucleic acid sequence comprises a nucleotide sequence encoding a second fusion 

20 protein comprising an activation domain and a second protein, the nucleotide sequence of 
the activation domain or the second protein containing a premature stop codon, and the 
second protein interacting with the first protein to produce a premature stop codon or 
nonsense mutation, and (iii) the third nucleic acid sequence comprises a regulatory element 
operably linked to a reporter gene, the expression of the reporter gene being regulated by 

25 the binding of the regulatory protein to the regulatory element: and (b) measiuing the 
expression and/or activity of the reporter gene. 

The alteration in reporter gene expression and/or activity in the reporter gene 
cell-based assays relative to a previously determined reference range, or to the expression or 
activity of the reporter gene in the absence of the compound or the presence of an 

30 appropriate control (e.g., a negative control such as phosphate buffered saline) indicates that 
a particular compound modulates premature translation termination and/or nonsense- 
mediated mRNA decay. In particular, a decrease in reporter gene expression or activity 
relative to a previously determined reference range, or to the expression in the absence of 
the compound or the presence of an appropriate control (e.g., a negative control) may, 
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depending upon the parameters of the reporter gene assay, indicate that a particular 
compound reduces or suppresses premature translation tennination and/or nonsense- 
mediated mRNA decay. In contrast, an increase in reporter gene expression or activity 
relative to a previously determined reference range, or to the expression in the absence of 
5 the compound or the presence of an appropriate control {e.g. , a negative control) may, 
depending upon the parameters of the reporter gene-based assay, indicate that a particular 
compound enhances premature translation termination and/or nonsense-mediated mRNA 
decay. 

The step of contacting a compound or a member of a library of compounds 
D with cell in the reporter gene-based assays described herein is preferably conducted under 
physiologic conditions. In specific embodiment, a compound or a member of a library of 
compounds is added to the cells in the presence of an aqueous solution. In accordance with 
this embodiment, the aqueous solution may comprise a buffer and a combination of salts, 
preferably approximating or mimicking physiologic conditions. Alternatively, the aqueous 
! solution may comprise a buffer, a combmation of salts, and a detergent or a surfactant. 
Examples of salts which may be used in the aqueous solution include, but not limited to. 
KCI, NaCl, and/or MgCb. The optimal concentration of each salt used in the aqueous 
solution is dependent on the cells and compounds used and can be determined using routine 
experimentation. The step of contacting a compound or a member of a library of 
compounds with a cell containing a reporter gene construct and in some circumstances, a 
nucleic acid sequence encoding a regulatory protein, may be performed for at least 0.2 
hours, 0.25 hours, 0.5 hours, 1 hour, 2 hours. 3 hours, 4 hours, 5 hours, 6 hours, 8 hours, 10 
hours, 12 hours, 18 hours, at least 1 day, at least 2 days or at least 3 days. 

The expression of a reporter gene and/or activity of the protein encoded by 
the reporter gene in the cell-based reporter-gene assays may be detected by any technique 
well-known to one of skill in the art. The expression of a reporter gene can be readily 
detected, e.g., by quantifying the protein and/or RNA encoded by said gene. Compounds 
that modulate premature translation termination and/or nonsense-mediated mRNA decay 
may be identified by changes in the gene encoding the premature translation stop codon, 
Le., there is readthrough of the premature translation stop codon and a longer gene product 
is detected. If a gene encoding a naturally-occurring premature translation stop codon is 
used, a longer gene product in the presence of a compound that modulates premature 
translation termination and/or nonsense-mediated mRNA decay can be detected by any 
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method in the art that permits the detection of the longer polypeptide, such as, but not 
limited to, immunological methods. 

Many methods standard in the art can be thus employed, including, but not 
limited to, immunoassays to detect and/or visualize gene expression (e.g.. Western blot, 
immunoprecipitation followed by sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE), immunocytochemistry, radioimmunoassays, ELISA (enzyme 
linked immunosorbent assay), "sandwich" immunoassays, immunoprecipitation assays, 
precipitin reactions, gel diffusion precipitin reactions, immunodiffusion assays, 
agglutination assays, complement-fixation assays, immunoradiometric assays, fluorescent 
immunoassays, protein A immunoassays, or an epitope tag using an antibody that is specific 
to the polypeptide encoded by the gene of interest) and/or hybridization assays to detect 
gene expression by detecting and/or visualizing respectively mRNA encoding a gene ie.g.. 
Northern assays, dot blots, in situ hybridization, etc), etc. Preferably, the antibody is 
specific to the C-terminal portion of the polypeptide used in an immunoassay. Such assays 
are routine and well known in the art (see, e.g., Ausubel et al. eds. 1994, Current Protocols 
in Molecular Biology, Vol. 1. John Wiley & Sons. Inc., New York, which is incorporated 
by reference herein in its entirety). Exemplary immunoassays are described briefly below 
(but are not intended by way of limitation). 

Immunoprecipitation protocols generally comprise lysing a population of 
cells in a lysis buffer such as RIPA buffer (1% NP-40 or Triton X-100. 1% sodium 
deoxycholate. 0.1% SDS, 0.15 M NaCl, 0.01 M sodium phosphate at pH 7.2. 1% Trasylol) 
supplemented with protein phosphatase and/or protease inhibitors (e.g., EDTA. PMSF, 
aprotinin. sodium vanadate), adding the antibody which recognizes the antigen to the cell 
lysate. incubating foraperiod of time(e.g., 1 to 4 hours) at 40° C. adding protein A and/or 
protein G sepharose beads to the cell lysate. incubating for about an hour or more at 40° C, 
washing the beads in lysis buffer and resuspending the beads in SDS/sample buffer. The 
ability of the antibody to immunoprecipitate a particular antigen can be assessed by, e.g., 
western blot analysis. One of skill in the art would be knowledgeable as to the parameters 
that can be modified to increase the binding of the antibody to an antigen and decrease the 
background (e.g., pre-clearing the cell lysate with sepharose beads). For further discussion 
regarding immunoprecipitation protocols see. e.g., Ausubel et al, eds, 1994, Current 
Protocols in Molecular Biology, Vol. 1. John Wiley & Sons. Inc.. New York at 10.16.1. 

Western blot analysis generally comprises preparing protein samples, 
electrophoresis of the protein samples in a polyacrylamide gel {e.g., 8%- 20% SDS-PAGE 
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depending on the molecular weight of the antigen), transferring the protein sample from the 
polyacrylamide gel to a membrane such as nitrocellulose, PVDF or nylon, blocking the 
membrane in blocking solution (e.g., PBS with 3% BSA or non-fat milk), washing the 
membrane in washing buffer (e.g., PBS-Tween 20), blocking the membrane with primary 
antibody (the antibody which recognizes the antigen) diluted in blocking buffer, washing 
the membrane in washing buffer, blocking the membrane with a secondary antibody (which 
recognizes the primary antibody, e.g., an anti-human antibody) conjugated to an enzymatic 
substrate (e.g., horseradish peroxidase or alkaline phosphatase) or radioactive molecule 
(e.g., P or '"l) diluted in blocking buffer, washing the membrane in wash buffer, and 
detecting the presence of the antigen. One of skill in the art would be knowledgeable as to 
the parameters that can be modified to increase the signal detected and to reduce the 
background noise. For fiuther discussion regarding western blot protocols see, e.g., 
Ausubel et al, eds, 1994, Current Protocols in Molecular Biology, Vol. 1, John Wiley & 
Sons, Inc., New York at 10.8.1. 
^ ^ EUS As comprise prq>aring antigen, coating the well of a 96 well microtiter 

plate with the antigen, adding a primary antibody (which recognizes the antigen) conjugated 
to a detectable compound such as an enzymatic substrate (e.g., horseradish peroxidase or 
alkaline phosphatase) to the well and incubating for a period of time, and detecting the 
presence of the antigen. In ELISAs the antibody of interest does not have to be conjugated 
to a detectable compound; instead, a second antibody (which recognizes the primary 
antibody) conjugated to a detectable compound may be added to the well. Further, instead 
of coating the well with the antigen, the antibody may be coated to the well. In this case, a 
second antibody conjugated to a detectable compound may be added following the addition 
of the antigen of interest to the coated well. One of skill in the art would be knowledgeable 
as to the parameters that can be modified to increase the signal detected as well as other 
variations of ELISAs known in the art. For further discussion regarding ELISAs see, e.g., 
Ausubel et al, eds, 1994, Current Protocols in Molecular Biology, Vol. 1, John Wiley & 
Sons, Inc., New York at 1 1.2.1. 

Methods for detecting the activity of a protein encoded by a reporter gene 
will vary with the reporter gene used. Assays for the various reporter genes are well-known 
to one of skill in the art. For example, as described in Section 5. 1 .1., luciferase, beta- 
galactosidase ("beta-gal"), beta-glucoronidase ("GUS"). beta-lactamase, chloramphenicol 
acetyltransferase ("CAT'), and alkaline phosphatase ("AP") are enzymes that can be 
analyzed in the presence of a substrate and could be amenable to high throughput screening. 
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For example, the reaction products of luciferase, beta-galactosidase ("beta-gal"), and 
alkaline phosphatase ("AP") are assayed by changes in light imaging {e.g., luciferase), 
spectrophotometric absorbance {e.g.. beta-gal), or fluorescence {e.g., AP). Assays for 
changes in light output, absorbance, and/or fluorescence are easily adapted for high 
throughput screening. For example, beta-gal activity can be measured with a microplate 
reader. Green fluorescent protein ("GFP") activity can be measured by changes in 
fluorescence. For example, in the case of mutant GFPs that fluoresce at 488 nm, standard 
fluorescence activated cell sorting ("FACS") equipment can be used to separate cells based 
upon GFP activity. 

Changes in mRNA stability of the gene encoding the premature translation 
stop codon can be measured. As discussed above, nonsense-mediated mRNA decay alters 
the stability of an mRNA with a premature translation stop codon so that such mRNA is 
targeted for rapid decay instead of translation. In the presence of a compound that 
modulates premature translation termination and/or nonsense-mediated mRNA decay, the 
stability of the mRNA with the premature translation stop codon is likely altered, 
stabilized. Methods of measuring changes in steady state levels of mRNA are well-known 
to one of skill in the art. Such methods include, but are not limited to. Northern blots, dot 
blots, solution hybridization, RNase protection assays, and SI nuclease protection assays, 
wherein the steady state levels of the mRNA of interest are measured with an appropriately 
labeled nucleic acid probe. Alternatively, methods such as semi-quantitative polymerase 
chain reaction ("PGR") can be used to measure changes in steady state levels of the mRNA 
of interest using the appropriate primers for amplification. 

Alterations in the expression of a reporter gene may be determined by 
comparing the level of expression and/or activity of the reporter gene to a negative control 
{e.g., PBS or another agent that is known to have no effect on the expression of the reporter 
gene) and optionally, a positive control {e.g., an agent that is known to have an effect on the 
expression of the reporter gene, preferably an agent that effects premature translation 
termination and/or nonsense-mediated mRNA decay). Alternatively, alterations in the 
expression and/or activity of a reporter gene may be determined by comparing the level of 
expression and/or activity of the reporter gene to a previously determined reference range. 

5J.2. Cell-Free Extracts 

After a vector containing the reporter gene construct and/or a vector(s) 
containing a nucleic acid sequence comprising a regulatory protein, a component or a 
subunit thereof is produced, a cell-free translation extract is generated or purchased, and a 
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compound library is synthesized or purchased or both, the cell-free translation extract and 
nucleic acid sequences are used to screen the library to identify compounds that modulate 
premature translation termination and/or nonsense-mediated mRNA decay. The reporter 
gene-based assays may be conducted in a cell-free manner by contacting a compound with a 
cell-free extract and a reporter gene construct comprising a reporter gene containing within 
the open reading frame of the reporter gene a premature stop codon or nonsense mutation, 
and measuring the expression and/or activity of said reporter gene. The reporter gene cell- 
free assays may also be conducted by contacting a compound with a cell-free extract and an 
in vitro transcribed RNA of a reporter gene, wherein the RNA product contains a premature 
stop codon or a nonsense mutation and measuring the expression and or activity of the 
protein encoded by the RNA product. Techniques for in vitro transcription are well-known 
to one of skill in the art or described herein (see, e.g. the Example in section 7). The 
reporter gene cell-free assays may also be conducted by: (a) contacting a compound with a 
cell-free extract, a first nucleic acid sequence and a second nucleic acid sequence, wherein 
the first nucleic acid sequence comprises a regulatory element operably linked to a reporter 
gene and the second nucleic acid sequence comprises a nucleotide sequence with a 
premature stop codon or nonsense mutation that encodes a regulatory protein that binds to 
the regulatory element of the first nucleic acid sequence and regulates the expression of the 
reporter gene; and (b) measuring the expression and/or activity of the reporter gene. 

The reporter gene cell-free assays may also be conducted by: (a) contacting a 
compound with a cell-free extract, a first nucleic acid sequence, a second nucleic acid 
sequence and a third nucleic acid sequence, wherein (i) the first nucleic acid sequence 
comprises a nucleotide sequence encoding a first fiision protein comprising a DNA binding 
domain and a first protein, the nucleotide sequence of the DNA binding domain or the first 
protein contaming a premature stop codon or nonsense mutation, (ii) the second nucleic acid 
sequence comprises a nucleotide sequence encoding a second fusion protein comprising an 
activation domain and a second protein, the second protein interacting with the first protein 
to produce a regulatory protein, and (iii) the third nucleic acid sequence comprises a 
regulatory element operably linked to a reporter gene, the expression of the reporter gene 
being regulated by the binding of the regulatory protein to the regulatory element: and (b) 
measuring the expression and/or activity of the reporter gene. Further, the reporter gene 
cell-free assays may also be conducted by: (a) contacting a compound with a cell-free 
extract, a first nucleic acid sequence, a second nucleic acid sequence and a third nucleic acid 
sequence, wherein (i) the first nucleic acid sequence comprises a nucleotide sequence 
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encoding a first fusion protein comprising a DNA binding domain and a first protein, (ii) the 
second nucleic acid sequence comprises a nucleotide sequence encoding a second fusion 
protein comprising an activation domain and a second protein, the nucleotide sequence of 
the activation domain or the second protein containing a premature stop codon or nonsense 
mutation, and the second protein interacting with the first protein to produce a regulatory 
protein, and (iii) the third nucleic acid sequence comprises a regulatory element operably 
hnked to a reporter gene, the expression of the reporter gene being regulated by the binding 
of the regulatory protein to the regulatory element: and (b) measuring the expression and/or 

activity of the reporter gene. 

In the cell-fi-ee reporter gene assays described herein, the alteration in reporter gene 
expression or activity relative to a previously determined reference range, or to the 
expression or activity of the reporter gene in the absence of the compound or the presence 
of an appropriate control {e.g., a negative control) indicates that a particular compound 
modulates premature translation termination and/or nonsense-mediated mRNA decay. In 
particular, a decrease in reporter gene expression or activity relative to a previously 
determined reference range, or to the expression in the absence of the compound or the 
presence of an appropriate control {e.g., a negative control) may. depending upon the 
parameters of the reporter gene assay, indicate that a particular compound reduces or 
suppresses premature translation termination and'or nonsense-mediated mRNA decay. In 
contrast, an increase in reporter gene expression or activity relative to a previously 
determined reference range, or to the expression in the absence of the compound or the 
presence of an appropriate control {e.g., a negative control) may, depending upon the 
parameters of the reporter gene-based assay, indicate that a particular compound enhances 
premature translation termination and/or nonsense-mediated mRNA decay. 

In accordance with the invention, the step of contacting a compound with a cell-fi^ee 
extract and a nucleic acid sequence in the reporter gene-based assays described herein is 
preferably conducted in an aqueous solution comprising a buffer and a combination of salts 
(such as KCl, NaCl and/or MgCb). The optimal concentration of each salt used in the 
aqueous solution is dependent on, e.g., the protein, polypeptide or peptide encoded by the 
nucleic acid sequence {e.g., the regulatory protein) and the compounds used, and can be 
determined using routine experimentation. In a specific embodiment, the aqueous solution 
approximates or mimics physiologic conditions. In another specific embodiment, the 
aqueous solution further comprises a detergent or a sxirfactant. 
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The cell-free reporter gene assays of the present invention can be performed using 
different incubation times. The cell-free extract and the nucleic acid sequence(s) {e.g., a 
reporter gene) can be incubated together before the addition of a compound or a member of 
a library of compounds. In certain embodiments, the cell-free extract are incubated with a 
5 nucleic acid sequence(s) (e.g., a reporter gene) before the addition of a compound or a 
member of a library of compounds for at least 0.2 hours, 0.25 hours, 0.5 hours, 1 hour, 2 
hours, 3 hours, 4 hours, 5 hours, 6 hours, 8 hours, 10 hours, 12 hours, 18 hours, or at least 1 
day. In other embodiments, the cell-free extract, or the nucleic acid sequence(s) (e.g, a 
reporter gene) is incubated with a compound or a member of a library of compounds before 
10 the addition of the nucleic acid sequence(s) (e.g., a reporter gene), or the cell-free extract, 
respectively. In certain embodiments, a compound or a member of a library of compounds 
is incubated with a nucleic acid sequence(s) (e.g., a reporter gene) or cell-free extract before 
the addition of the remaining component, i.e., cell-free extract, or a nucleic acid sequence(s) 
(e.g., a reporter gene), respectively, for at least 0.2 hours, 0.25 hours, 0.5 hours. 1 hour, 2 
15 hours, 3 hours, 4 hours. 5 hours, 6 hours. 8 hours, 10 hours. 12 hours, 18 hours, or at least 1 
day. Once the reaction vessel comprises tiie components, i.e., a compound or a member of a 
library of compounds, the cell-free extract and the nucleic acid sequence(s) (e.g., a reporter 
gene), the reaction may be further incubated for at least 0.2 hours, 0.25 hours, 0.5 hours, 1 
hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 8 hours, 10 hours, 12 hours, 18 hours, or at 
20 least 1 day. 

The progress of the reaction in the cell-free reporter gene-based assays can be 
measured continuously. Alternatively, time-points may be taken at different times of the 
reaction to monitor the progress of the reaction in the cell-free reporter gene-based assays. 

The activity of a compound in the cell-free extract can be determined by 
25 assaying the activity of a reporter protein encoded by a reporter gene, or alternatively, by 
quantifying the expression of the reporter gene by, for example, labeling the in vitro 
translated protein (e.g., with ^^S-labeled methionine), northern blot analysis, RT-PCR or by 
immunological methods, such as westem blot analysis or immunoprecipitation. Such 
methods are well-known to one of skill in the art. Examples of assays which can be used to 
30 measure the expression and/or activity of a reporter gene are described in Section 5.3. 1 
supra. 

Character ization of the Structure of Comnounds 

If the library comprises arrays or microarrays of compounds, wherein each 
compound has an address or identifier, the compound can be deconvoluted. e.g., by cross- 
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referencing the positive sample to original compound list that was applied to the individual 
test assays. 

If the library is a peptide or nucleic acid library, the sequence of the 
compound can be determined by direct sequencing of the peptide or nucleic acid. Such 
methods are well known to one of skill in the art. 

A number of physico-chemical techniques can be used for the de novo 
characterization of compounds that mediate premattire translation termination and/or 
nonsense-mediated mRNA decay. 

5.4.1. Mass Spectrometry 

The invention provides, in part, for mass spectrometry methods to identify or 
characterize the compounds of the invention. Any mass spectrometric method can be used, 
for example, those employing an ionizer, ion analyzer and detector. 

A number of techniques can be used in order to ionize a sample for 
invetigative or characterization purposes. Such techniques form the charged particles 
required for analysis. Examples of ionization methods include, but are not limited to, 
electron impact, chemical ionization, electrospray ionization, fast atom bombardment and 
matrix assisted laser desorption ionization. The technique used for ionization will depend 
on the type of analyte being examined and the conditions necessary for acquisition. For 
example, electron impact and chemical ionization would be preferred with a relatively small 
volatile sample with a mass of 1 to 1000 daltons; electrospray ionzation would be preferred 
with peptides, proteins and non-volatile samples with a mass of up to 200,000 daltons, fast 
atom bombardment would be preferred with carbohydrates, organometallics. peptides and 
nonvolatile compounds and matrix assisted laser desorption ionization would be preferred 
when examining peptides, proteins and nucleotides. 

A number of ion analysis techniques can be used, in particular those where 
molecular ions and fragment ions are accelerated by manipulation of charged particles 
through the mass spectrometer. Such analyzers include, but are not limited to, quadropole. 
sector (magnetic and/or electrostatic), time of flight (TOF). and ion cyclotron resonance 
(ICR). The technique used for analysis would depend on the sample and the conditions for 
acquisition. For example, one might prefer quadropole when desiring a unit mass 
resolution, fast scan time, and low cost; one might prefer a sector analyzer when desiring 
high resolution and an exact mass; one might prefer time of flight when desiring no 
limitation for m/z maximum and a high tiiroughput; and one might prefer ion cyclotron 
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resonance when desiring very high resolution an exact mass and also to perform ion 
chemistry. 

Any ionizer method can be combined with any ion analyzer technique. 
There are many types of detectors that may be used as part of the mass spectrophotometric 
methods of the invention, in particular those that produce an electronic signal when struck 
by an ion. Calibration would be necessarily performed by introducing a well known 
compound into the instrument and adjusting the circuits so that the compound's molecular 
ion and fragment ions are reported accurately. 

Mass spectrometry (e.g.. electrospray ionization ("ESI") and matrix-assisted 
laser desorption-ionization ("MALDI"), Fourier-transform ion cyclotron resonance ("FT- 
ICR") can be used both for high-throughput screening of compounds that bind to a target 
RNA and elucidating the structure of the compound. 

MALDI uses a pulsed laser for desoiption of the ions and a time-of-flight 
analyzer, and has been used for the detection of noncovalent tRNA:amino-acyl-tRNA 
synthetase complexes (Gruic-Sovulj et al., 1997, J. Biol. Chem. 272:32084-32091). 
However, covalent cross-linking between the target nucleic acid and the compound is 
required for detection, since a non-covalently bound complex may dissociate during the 
MALDI process. 

ESI mass spectrometry ("ESI-MS") has been of greater utility for studying 
non-covalent molecular interactions because, unlike the MALDI process. ESI-MS generates 
molecular ions with little to no fragmentation (Xavier et al., 2000, Trends Biotechnol. 
18(8):349-356). ESI-MS has been used to study the complexes formed by HIV Tat peptide 
and protein with the TAR RNA (Sannes-Lowery et al., 1997, Anal. Chem. 69:5130-5135). 

Fourier-transform ion cyclotron resonance ("FT-ICR") mass spectrometry 
provides high-resolution spectra, isotope-resolved precursor ion selection, and accurate 
mass assignments (Xavier et al., 2000, Trends Biotechnol. 18(8):349-356). FT-ICR has 
been used to study the interaction of aminoglycoside antibiotics with cognate and non- 
cognate RNAs (Hofetadler et at., 1999, Anal. Chem. 71:3436-3440; Griffey et al., 1999. 
Proc. Natl. Acad. Sci. USA 96:10129-10133). As true for all of the mass spectrometry 
methods discussed herein, FT-ICR does not require labeling of the target RNA or a 
compound. 

An advantage of mass spectroscopy is not only the elucidation of the 
structure of a compound, but also the determination of the structure of the compound bound 
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to a target RNA. Such information can enable the discovery of a consensus structure of a 
compound that specifically binds to a target RNA. 

5.4.2. NMR Spectroscopy 
The invention provides, in part, NMR spectroscopic techniques that may be 
used, for example, to characterize and identify small and large molecules of the invention. 
NMR methods are advantageous in understanding characteristics of the compounds of the 
invention because it allows rapid acquisition of single and multi-dimensional structural data 
about a compound in solution. Moreover, the NMR technique is a non-destructive 
technique that also provides dynamic information relating to a compound's behavior in 
complex or in association with other molecules of interest. There are a variety of 
techniques that can be used to examine compounds of the invention using NMR methods. 
In particular, any type of NMR spectrometer can be used, including, but not limited to, 
those of low, medium and high magnetic field. In a preferred embodiment, the NMR 
spectrometer that is used has a high magnetic field, in particular, if the compound has a high 
molecular wei^t, such as, those greater than 1000 daltons. 

Any technique known in the art can be used to acquire data on the 
compounds and also to produce spectra for inteipretation, including, but not limited to, 
those that measure through bond correlations and through space correlations. Both single 
and multi-dimensional spectra can be produced. In another embodiment, the technique that 
isusedishomonuclear. In yet another embodiment, the technique is heteronuclear. In one 
embodiment of the invention, correlation spectroscopy, e.g.. COSY or TOCSY. methods are 
used to measure through bond correlations. In another embodiment of the invention, 
nuclear overhauser effect spectroscopy methods, e.g., NOESY, are used to measure through 
space correlations. In yet another embodiment of the invention, multi-dimensional methods 
are used to identify relationships between heterologous nucleii, e.g., heteronuclear single 
quantum coherence (HSQC) and heteronuclear muUiple quantum coherence (HMQC). 

In another embodiment of the invention, NMR methods are used to 
characterize compounds that are associated with other molecules. For example, complexed 
target nucleic acids can be examined by qualitatively determining changes in chemical shift, 
specifically from distances measured using relaxation effects, and NMR-based approaches 
have been used in the identification of small molecule binders of protein drug targets 
(Xavier et al., 2000, Trends Biotechnol. 18(8):349-356). The determination of structure- 
activity relationships ("SAR") by NMR is the first method for NMR described in which 
small molecules that bind adjacent subsites are identified by two-dimentional 'H-'^N spectra 
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Of the target protein (Shuker et al., 1996. Science 274:1531-1534). The signal from the 
bound molecule is monitored by employing line broadening, transfened NOEs and pulsed 
field gradient diffusion measurements (Moore, 1999, Curr. Opin. Biotechnol. 10:54-58). A 
strategy for lead generation by NMR using a library of small molecules has been recently 
5 described (Fejzo et al., 1 999, Chem. Biol. 6:755-769). 

Other examples of NMR methods that can be used for the invention include, 
but are not limited to, one-dimensional, two-dimensional, three dimension, four dimensional 
NMR methods as well as correlation spectroscopy ("COSY"), and nuclear Overhauser 
effect ("NOE") spectroscopy. Such methods of structure determination of compounds are 
10 well known to one of skill in the art. 

Similar to mass spectroscopy, an advantage of NMR is the not only the 
elucidation of the structure of a compound, but also the determination of the structure of the 
compound bound to the target RNA. Such infoimation can enable the discovery of a 
consensus structure of a compound that specifically binds to a target RNA. 

5.4.3. X-rav CrvstallographY 
X-ray crystallography can be used to elucidate the structure of a compound. 
For a review of x-ray crystallography see, e.g., Blundell et al. 2002, Nat Rev Drug Discov 
l(l):45-54. The first step in x-ray crystallogr^hy is the formation of crystals. The 
formation of crystals begins with the preparation of highly purified and soluble samples. 
20 The conditions for crystallization is then determined by optimizing several solution 

variables known to induce nucleation, such as pH, ionic strength, temperature, and specific 
concentrations of organic additives, salts and detergent. Techniques for automating the 
crystallization process have been developed to automate the production of high-quality 
protein crystals. Once crystals have been formed, the crystals are harvested and prepared 
25 for data collection. The crystals are then analyzed by diffraction (such as multi-circle 
diffractometers, high-speed CCD detectors, and detector off-set). Generally, multiple 
crystals must be screened for structure determinations. 

A number of methods can be used to acquire a dif&action patter so that a 
compound can be characterized. In one embodiment, an X-ray source is provided, for 
example, by a rotating anode generator producing an X-ray beam of a characteristic 
wavelength. There are a number of sources of X-ray radiation Uiat may be used in die 
methods of the invention, including low and high intensity radiation. In one example, the 
timable X-ray radiation is produced by a Synchrotron. In another embodiment, the primary 
X-ray beam is monochromated by either crystal monochromators or focusing mirrors and 
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the beam is passed through a helium flushed collimator. In a preferred embodiment, the 
crystal is mounted on a pin on a goniometer head, that is mounted to a goniometer which 
allows to position the crystal in different orientations in the beam. The diffracted X-rays can 
be recorded using a number of techniques, including, but not limited to image plates, 
multiwire detectors or CCD cameras. In other embodiments, flash cooling, for example, of 
protein crystals, to cryogenic temperatures (-100 K) offers many advantages, the most 
significant of which is the elimination of radiation damage. 

5.4.4. Vibrational SpectrscoPV 
Vibrational spectroscopy {e.g., but not limited to. infrared (IR) spectroscopy 
or Raman spectroscopy) can be used for elucidating the structure of a compound. 

Infrared spectroscopy measures the frequencies of infrared light 
(wavelengths from 100 to 10,000 nm) absorbed by the compound as a resuh of excitation of 
vibrational modes according to quantum mechanical selection rules which require that 
absorption of light cause a change in the electric dipole moment of the molecule. The 
infixed spectrum of any molecule is a unique pattern of absorption wavelengths of varying 
intensity that can be considered as a molecular fingeiprint to identify or characterize any 
compound. 

Infrared spectra can be measured in a scanning mode by measuring the 
absorption of individual frequencies of light, produced by a grating which separates 
frequencies from a mixed-frequency infrared light source, by the compound relative to a 
standard intensity (double-beam instrument) or pre-measured ('blank') intensity 
(single-beam instrument). In a preferred embodiment, infrared spectra are measured in a 
pulsed mode ("FT-IR") where a mixed beam, produced by an interferometer, of all infrared 
Ught frequencies is passed through or reflected off the compound. The resulting 
interferogram. which may or may not be added with the resulting interferograms from 
subsequent pulses to increase the signal strength while averaging random noise in the 
electronic signal, is mathematically transformed into a spectrum using Fourier Transform or 

Fast Fourier Transform algorithms. 

Raman spectroscopy measures the difference in frequency due to absorption 
of infrared frequencies of scattered visible or ultraviolet light relative to the incident beam. 
The incident monochromatic light beam, usually a single laser frequency, is not truly 
absorbed by the compound but interacts with the electric field transiently. Most of the light 
scattered off the sample will be unchanged (Rayleigh scattering) but a portion of the scatter 
light will have frequencies that are the sum or difference of the incident and molecular 
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vibrational frequencies. The selection rules for Raman (inelastic) scattering require a change 
in polarizability of the molecule. While some vibrational transitions are observable in both 
infrared and Raman spectrometry, must are observable only with one or the other technique. 
The Raman spectrum of any molecule is a unique pattern of absorption wavelengths of 
5 varying intensity that can be considered as a molecular fingerprint to identify any 
compound. 

Raman spectra are measured by submitting monochromatic light to the 
sample, either passed through or preferably reflected off, filtering the Rayleigh scattered 
light, and detecting the frequency of the Raman scattered light. An improved Raman 
1 0 spectrometer is described in US Patent No. 5,786,893 to Fink et al., which is hereby 
incorporated by reference. 

Vibrational microscopy can be measured in a spatially resolved fashion to 
address single beads by integration of a visible micrx>scope and spectrometer. A 
microscopic infrared spectrometer is described in U.S. Patent No. 5,581,085 to Reffiier et 
15 al., which is hereby incorporated by reference in its entirety. An instrument that 

simultaneously performs a microscopic infrared and microscopic Raman analysis on a 
sample is described in U.S. Patent No. 5,841,139 to Sostek et al., which is hereby 
incorporated by reference in its entirety. 

In one embodiment of the method, compounds are synthesized on 
polystyrene beads doped with chemically modified styrene monomers such that each 
resulting bead has a characteristic pattern of absorption lines in the vibrational (IR or 
Raman) spectrum, by methods including but not limited to those described by Fenniri et al., 
2000, J. Am. Chem. Soc. 123:8151-8152. Using methods of split-pool synthesis familiar to 
one of skill in the art. the library of compounds is prepared so that the spectroscopic pattern 
of the bead identifies one of the components of the compound on the bead. Beads that have 
been separated according to their ability to bind target RNA can be identified by their 
vibrational spectrum. In one embodiment of the method, appropriate sorting and binning of 
the beads during synthesis then allows identification of one or more further components of 
the compound on any one bead. In another embodiment of the method, partial identification 
of the compound on a bead is possible through use of the spectroscopic pattern of the bead 
with or without the aid of further sorting during synthesis, followed by partial resynthesis of 
the possible compounds aided by doped beads and appropriate sorting during synthesis. 

In another embodiment, the ffi. or Raman spectra of compounds are 
examined while the compound is still on a bead, preferably, or after cleavage from bead, 
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using methods including but not limited to photochemical, acid, or heat treatment. The 
compound can be identified by comparison of the IR or Raman spectral pattern to spectra 
previously acquired for each compound in the combinatorial library. 

5.5. Naturally Occurring Genes with Premature Stop Codons: 
Fnamnles of Disorders and Diseases 

The invention provides for naturally occurring genes with premature stop 
codons to ascertain the effects of compounds on premature translation termination and/or 
nonsense-mediated mRNA decay. In general, the expression of the gene product, in 
particular, a fiiU-length gene product, is indicative of the effect of the compounds on 
premature translation tennination and/or nonsense-mediated mRNA decay. 

In a preferred embodiment, the naturally occurring genes with premature 
stop codons are genes that cause diseases which are due, in part, to the lack of expression of 
the gene resulting from the premature stop codon. Such diseases include, but are not limited 
to, cystic fibrosis, muscular dystrophy, heart disease {e.g., familial hypen^holesterolemia). 
p53-associated cancers (e.g., lung, breast, colon, pancreatic. non-Hodgkin's lymphoma, 
ovarian, and esophageal cancer), colorectal carcinomas, neurofibromatosis, retinoblastoma, 
Wilm's tumor, retinitis pigmentosa, collagen disorders {e.g., osteogenesis imperfecta and 
cirrhosis). Tay Sachs disease, blood disorders {e.g., hemophilia, von WiUebrand disease, b- 
Thalassemia). kidney stones, ataxia-telangiectasia, lysosomal storage diseases, and tuberous 
sclerosis. Genes involved in the etiology of these diseases are discussed below. 

The recognition of translation termination signals is not necessarily limited 
to a simple trinucleotide stop codon, but is instead recognized by the sequences surrounding 
the stop codon in addition to the stop codon itself (see. e.g., Manuvakhova et al, 2000, 
RNA 6(7):1044-1055, which is hereby incorporated by reference in its entirety). Thus, any 
genes containing particular tetranucleotide sequences at the stop codon. such as, but not 
limited to, UGAC. UAGU, UAGC. UAGG, UAGA, UGAA, UGAG. UGAU. UAAC. 
UAAU. UAAG. and UAAA, are candidates of naturally occurring genes with premature 
stop codons that are useful in the present invention. Human disease genes that contain these 
particular sequence motifs are sorted by chromosome is presented as an Example in Section 
8. 

5.5.1. rysric Fibrosis 

Cystic fibrosis is caused by mutations in the cystic fibrosis conductance 

regulator ("CFTR") gene. Such mutations vary between populations and depend on a 

multihide of factors such as. but not limited to, ethnic background and geographic location. 
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Nonsense mutations in the CFTR gene are expected to produce little or not CFTR chloride 
channels. Several nonsense mutations in the CFTR gene have been identified (see, e.g., 
Tzetis et al., 2001, Hum Genet. 109(6):592-60l. Strandvik et al., 2001. Genet Test. 
5(3):235-42; Feldmann et al., 2001, Hum Mutat. 17(4):356; Wilschanski et al., 2000, Am J 
5 Respir Crit Care Med. 161(3 Pt l):860-5; Castaldo et al., 1999, Hum Mutat. 14(3):272; 
Mittre et al., 1999, Hum Mutat. 14(2):182; Mickle et al., 1998, Hum Mol Genet. 
7(4):729-35; Casals etal., 1997, Hum Genet. 101(3):365-70; Mittre et al., 1996. Hum 
Mutat. 8(4):392-3; Bonizzato et al, 1995, Hum Genet. 1995 Apr;95(4):397-402; Greil et 
al, 1995, Wien Klin Wochenschr. 107(15):464-9; Zielenski etal., 1995, Hum Mutat. 
10 5(l):43-7; Dork et al., 1994. Hum Genet. 94(5):533-42; Balassopoulou etal., 1994, Hum 
Mol Genet. 3(10):1887-8; Ghanem et al, 1994, 21(2):434-6; Will et al., J Clin Invest. 1994 
Apr;93(4): 1852-9; Hull et al., 1994. Genomics. 1994 Jan 15;19(2):362-4; Dork et al., 1994. 
Hum Gaiet. 93(l):67-73; Rolfini & Cabrini. 1993, J Clin Invest. 92(6):2683-7; Will et al., 
1993, J Med Genet. 30(10):833-7; Bienvenu et al., 1993, J Med Genet. 30(7):621-2; 
15 Cheadle et al., 1993, Hum Mol Genet. 2(7):1067-8; Casals et al., 1993, Hum Genet. 

91(l):66-70; Reiss et al., 1993, Hum Genet. 91(l):78-9; Chevalier-Porst et al., 1992. Hum 
Mol Genet. l(8):647-8; Hamosh et al., 1992. Hum Mol Genet. l(7):542-4; Gasparini et al., 
1992. J Med Genet. 29(8):558-62; Fanen et al, 1992, Genomics. I3(3):770-6; Jones et al., 
1992. Hum Mol Genet. 1(1):1 1-7; Ronchetto et al, 1992. Genomics. 12(2):417-8.; Macek 
20 et al., 1992, Hum Mutat. l(6):501-2; Shoshani et al, 1992, Am J Hum Genet. 50(l):222-8; 
Schloesser et al., 1991. J Med Genet. 28(12):878-80; Hamosh et al., 1991, J Clin Invest. 
88(6):1880-5; Bal et al., 1991, J Med Genet. 28(10):715-7; Dork et al., 1991, Hum Genet. 
87(4):44i-6; Beaudet et al., 1991. Am J Hum Genet. 48(6):1213; Gasparini et ai, 1991, 
Genomics. 10(1): 193-200; Cutting etal., 1990, N Engl J Med. 1990, 323(24): 1685-9; and 
25 Kerem et al., 1990, Proc Natl Acad Sci USA. 87(21):8447-51. the disclosures of which are 
hereby incorporated by reference in their entireties). Any CFTR gene encoding a premature 
translation codon including, but not limited to, the nonsense mutations described in the 
references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsense-mediated mRNA decay. 

30 5.5.2. Muscular Dystrophy 

Muscular dystrophy is a genetic disease characterized by severe, progressive 
muscle wasting and weakness. Duchenne muscular dystrophy and Becker muscular 
dystrophy are generally caused by nonsense mutations of the dystrophin gene (see, e.g.. 
Ken- et al., 2001, Hum Genet. 109(4):402-7 and Wagner et al., 2001, Ann Neurol. 
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49(6):706-l 1). Nonsense mutations in other genes have also been implicated in other types 
of muscular dystrophy, such as, but not limited to. collagen genes in Ullrich congenital 
muscular dystrophy (see. e.g., Demir et al, 2002, Am J Hum Genet. 70(6): 1446-58), the 
emerin gene and lamins genes in Emery-Dreifuss muscular dystrophy (see. e.g.. Holt et al., 
2001, Biochem Biophys Res Commun. 287(5):1 129-33; Becane et al.. 2000. Pacing Clin 
Electrophysiol. 23(11 Pt l):1661-6; andBoraieef a/., 2000. Ann Neurol. 48(2): 170-80.). the 
dysferlin gene in Miyoshi myopathy (see, e.g., Nakagawa et al.. 2001. J Neurol Sci. 
l84(l):15-9), the plectin gene in late onset muscular dystrophy (see, e.g., Bauer et al.. 2001, 
Am J Pathol. 158(2):617-25), the delta-sarcoglycan gene in recessive limb-girdle muscular 
dystrophy (see. e.g.. Duggan et al., 1997. Neurogenetics. l(l):49-58), the laminina2-chain 
gene in congenital muscular dystrophy (see, e.g., Mendell et al.. 1998, Hum Mutat. 
12(2):135). the plectin gene in late-onset muscular dystrophy (see, e.g., Rouan et al.. 2000, J 
Invest Dermatol. 114(2):381-7 and Kunzetal.. 2000, J Invest Dermatol. 114(2):376-80), the 
myophosphorylase gene in McArdle's disease (see, e.g., Bruno et al, 1999, Neuiomuscul 
Disord. 9(l):34-7). and the collagen VI m Bethlem myopathy (see. e.g., Lamande et al., 
1998. Hum Mol Genet. 1998 Jun;7(6):981-9). 

Several nonsense mutations in the dystrophin gene have been identified (see, 
e.g., Kerr et al., 2001, Hum Genet. 109(4):402-7; Mendell et al, 2001. Neurology 
57(4):645-50; Fajkusova et al, 2001. Neuromuscul Disord. 1 1(2):133-8; Ginjaar et al, 
2000. Eur J Hum Genet. 8(10):793-6; Lu et al, 2000, J Cell Biol. 148(5):985-96; 
Tuffery-Giraud et al, 1999, Hum Mutat. 14(5):359-68; Fajkusova et al, 1998, J 
Neurogenet. 12(3):183-9; Tuffery et al, 1998, Hum Genet. 102(3):334-42; Shiga et al, 
1997. J Clin Invest. 100(9):2204-10; Winnard et al, 1995, Am J Hum Genet. 56(1): 158-66; 
Prior et al, 1994. Am J Med Genet. 50(l):68-73; Prior et al, 1993, Hum Mol Genet. 
2(3):31 1-3; Prior et al, 1993. Hum Mutat. 2(3): 192-5; Nigro et al.. 1992. Hum Mol Genet. 
l(7):517-20; Worton, 1992. J Inherit Metab Dis. 15(4):539-50; and Bulman et al, 1991, 
Genomics. l0(2):457-60; the disclosures of which are hereby incorporated by reference in 
their entireties). Any gene encoding a premature translation codon implicated in muscular 
dystrophy including, but not limited to, the nonsense mutations described in the references 
cited above, can be used in the present invention to identify compounds that mediate 
premature translation tenmination and/or nonsense-mediated mRNA decay. 

5.5J. Familial Hvpercholest erolemia 
Hypercholesterolemia, or high blood cholesterol, results from either the 

overproduction or the underutilization of low density lipoprotein ("LDL"). 
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Hypercholesterolemia is caused by either the genetic disease familial hypercholesterolemia 
or the consumption of a high cholesterol diet. Nonsense mutations in the LDL receptor 
gene have been implicated in familial hypercholesterolemia. Several nonsense mutations in 
the LDL receptor gene have been identified (see, e.g., Lind et al., 2002, Atherosclerosis 
5 1 63(2):399-407; Salazar et al. , 2002, Hum Mutat. 1 9(4):462-3; Kuhrova et al. , 2002, Hum 
Mutat. 19(1):80; Zakharova et al., 2001, Bioorg Khim. 27(5):393-6; Kuhrova et al., 2001, 
Hum Mutat. 18(3):253; Genschel et al., 2001, Hum Mutat. 17(4):354; Weiss et al., 2000, J 
Inherit Metab Dis. 23(8):778-90; Mozas et al., 2000. Hum Mutat. 15(5):483-4; Shin et al., 
2000, Clin Genet. 57(3):225-9; Graham etal., 1999, Atherosclerosis 147(2):309-16; Hattori 
10 et al., 1999, Hum Mutat. 14(1):87; Cenarro et al., 1998, Hum Mutat. 1 1(5):413; Rodningen 
etal, 1999, Hum Mutat. 13(3):186-96; Hirayama a/.. 1998, J Hum Genet. 43(4):250-4; 
Lind et al., 1998, J Intern Med. 244(1): 19-25; Thiart et al., 1997. Mol Cell Probes 
1 1(6):457-8; Maruyama et al., 1995, Arterioscler Thromb Vase Biol. 15(10):1 713-8; 
Koivisto et al., 1995. Am J Hum Genet. 57(4):789-97; Lombaidi et al., 1995, J Lipid Res. 
15 36(4):860-7; Leren et al., 1993, Hum Genet. 92(1):6-10; Landsberger et al., 1992. Am J 
Hum Genet. 50(2):427-33; Loux et al., 1992, Hum Mutat. 1992;l(4):325-32; Motulsky, 
1989, Arteriosclerosis. 9(1 Suppl):D-7; Lehiman et al., 1987, J Biol Chem. 262(1):401-10; 
and Lehrman et al., 1985, Cell 41(3):735-43; the disclosures of which are hereby 
incorporated by reference in their entireties). Any LDL receptor gene encoding a premature 
20 translation codon including, but not limited to, the nonsense mutations described in the 
references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsense-mediated mRNA decay. 

5.5.4. p53-associated Cancers 

Mutant forms of the p53 protein, which is thought to act as a negative 
25 regulator of cell proliferation, transformation, and tumorigenesis, have been implicated as a 
common genetic change characteristic of human cancer (see, e.g., Levine et al., 1991, 
Nature 35 1 .453-456 and HoUstein et al., 1991. Science 253:49-53). p53 mutations have 
been implicated in cancers such as, but not limited to, lung cancer, breast cancer, colon 
cancer, pancreatic cancer, non-Hodgkin's lymphoma, ovarian cancer, and esophageal 
30 cancer. 

Nonsense mutations have been identified in the p53 gene and have been 

implicated in cancer. Several nonsense mutations in the p53 gene have been identified (see, 

e.g., Masuda et al., 2000, Tokai J Exp Clin Med. 25(2):69-77; Oh et al., 2000. Mol Cells 

10(3):275-80; Li et al., 2000, Lab Invest. 80(4):493-9; Yang et al., 1999. Zhonghua Zhong 

-67- 



( ( 

wo 2004/001010 PCT/IJS2003/019760 

Liu Za Zhi 21(2):1 14-8; Finkelstein et al, 1998, Mol Diagn. 3(1):37-41; Kajiyama et ai, 
1998, Dis Esophagus. 1 1(4):279-83; Kawamura et at.. 1999, Leak Res. 23(2): 1 1 5-26; Radig 
etal, 1998, Hum Pathol. 29(11): 1310-6; Schuyerera/., 1998, Int J Cancer 76(3):299-303; 
Wang-Gohrke et al, 1998. Oncol Rep. 5(l):65-8; Fulop et ai, 1998, J Reprod Med. 
43(2):1 19-27; Ninomiya et al, 1997, J Dermatol Sci. l4(3):173-8; Hsieh et al, 1996. 
Cancer Lett. 100(1-2):107-13; Rail et al, 1996, Pancreas. 12(l):10-7; Fukutomi etal, 1995, 
Nippon Rinsho. 53(1 1):2764-8; Frebourg et al., 1995, Am J Hum Genet. 56(3):608-15; 
Dove et al, 1995, Cancer Surv. 25:335-55; Adamson et al, 1995, Br J Haematol. 
89(l):61-6; Grayson et al., 1994. Am J Pediatr Hematol Oncol. l6(4):34l-7; Lepelley et al, 
1994, Leukemia. 8(8):1342-9; Mclntyre etal, 1994, J Clin Oncol. 12(5):925-30; Horio et 
al, 1994, Oncogene. 9(4):1231-5; Nakamura et al, 1992, Jpn J Cancer Res. 83(12):1293-8; 
Davidoff et al.. 1992. Oncogene. 7(l):127-33; and Ishioka et al, 1991, Biochem Biophys 
Res Commun. 177(3):901-6; the disclosures of which are hereby incorporated by reference 
in their entireties). Any p53 gene encoding a premature translation codon including, but not 
limited to. the nonsense mutations described in the references cited above, can be used in 
the present invention to identify compounds that mediate premature translation termination 
and/or nonsense-mediated mRNA decay. 

5.5.5. Colorectal Carcinomas 
Molecular genetic abnormalities resulting in colorectal carcinoma involve 
tumor-suppressor genes that undergo inactivation (such as. but not limited to, ape, mcc, dec, 
p53, and possibly genes on chromosomes 8p. Ip. and 22q) and dominant-acting oncogenes 
(such, but not limited to, ras, src, and myc) (see, e.g., Hamilton. 1992, Cancer 70(5 
Suppl): 121 6-21). Nonsense mutations in the adenomatous polyposis coli ("APC") gene and 
mismatch repair genes (such as, but not limited to, mlhl and msh2) have also been 
described. Nonsense mutations have been implicated in colorectal carcinomas (see, e.g., 
Viel et al, 1997, Genes Chromosomes Cancer. 18(1):8-18; Akiyama et al, 1996, Cancer 
78(12):2478-84; Itoh & Imai , 1996. Hokkaido Igaku Zasshi 71(1):9-14; Kolodner et al, 
1994, Genomics. 24(3):516-26; Ohue et al, 1994, Cancer Res. 54(17):4798-804; and Yin et 
al, 1993. Gastroenterology. 104(6): 1633-9; the disclosures of which are hereby 
incorporated by reference in their entireties). Any gene encoding a premature translation 
codon implicated in colorectal carcinoma including, but not limited to, the nonsense 
mutations described in the references cited above, can be used in the present invention to 
identify compoimds that mediate premature translation termination and/or nonsense- 
mediated mRNA decay. 
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5.5.6. Neurofibromatosis 

Neurofibromatosis is an inherited disorder, which is commonly caused 
caused by mutations in the NFl and NF2 tumor suppressor genes. It is characterized by 
multiple intracranial tumors including schwannomas, meningiomas, and ependymomas. 
5 Nonsense mutations in the NFl and NF2 genes have been described. Nonsense mutations 
have been implicated in neurofibromatosis (see, e.g., Lamszus et al, 2001, Int J Cancer 
91(6):803-8; Sestini et al, 2000, Hum Genet. 107(4):366-71; Fukasawa et al, 2000, Jpn J 
Cancer Res. 91(12):1241-9; Park etal, 2000, J Hum Genet. 45(2):84-5; Ueki et al, 1999, 
Cancer Res. 59(23):5995-8; , 1999, Hokkaido Igaku Zasshi. 74(5):377-86; Buske et al, 
10 1999, Am J Med Genet. 86(4):328-30; Harada et al, 1999, Surg Neurol. 5l(5):528-35; 
Krkljus et al, 1998, Hum Mutat. 1 1(5):41 1; Klose et al, 1999. Am J Med Genet. 
83(1);6-12; Park & Pivnick, 1998, J Med Genet. 35(1 0):8 13-20; Bahuau et al, 1998, Am J 
Med Genet. 75(3):265-72; Bijlsma et al, 1997, J Med Genet. 34(1 1):934-6; MacCoUin et 
al, 1996, Ann Neurol. 40(3):440-5; Upadhyaya et al, 1996, Am J Med Genet. 67(4):421-3; 
15 Robinson et al, 1995, Hum Genet. 96(l):95-8.; Legius et al, 1995. J Med Genet. 

32(4):316-9; von Deimling et al, 1995, Brain Pathol. 5(1):1 1-4; Dublin et al, 1995, Hum 
Mutat. 5(l):81-5; Legius etal, 1994, Genes Chromosomes Cancer. 10(4):250-5; Purandare 
et al, 1994, Hum Mol Genet. 3(7):1 109-15; Shen & Upadhyaya, 1993, Hum Genet. 
92(4):410-2; and Estivill etal, 1991, Hum Genet. 88(2):185-8; the disclosures of which are 
20 hereby incorporated by reference in their entireties). Any gene encoding a premature 
translation codon implicated in neurofibromatosis including, but not limited to, the 
nonsense mutations described in the references cited above, can be used in the present 
invention to identify compounds that mediate premature translation termination and/or 
nonsense-mediated mRNA decay. 

2^ 5.5.7. Retinoblastoma 

The retinoblastoma gene plays important roles in the genesis of human 
cancers. Several pieces of evidence have shown that the retinoblastoma protein has dual 
roles in gating cell cycle progression and promoting cellular differentiation (see. e.g., Lee & 
Lee, 1997, Gan To Kagaku Ryoho 24(1 1):1368-80 for a review). Nonsense mutations in 
30 the RB 1 gene have been described. Nonsense mutations have been implicated in 

retinoblastoma (see, e.g.. Klutz et al., 2002, Am J Hum Genet. 71(l):174-9; Alonso et al., 
2001, Hum Mutat. 17(5):412-22; Wong et al., 2000, Cancer Res. 60(21):617I-7; Harbour , 
1998, Ophthalmology 105(8): 1442-7; Fulop et al., 1998, J Reprod Med. 43(2): 1 19-27; 
Onadim et al., 1997, Br J Cancer 76(1 1): 1405-9; Lohmann et al., 1997. Ophthalmologe 
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94(4):263-7; Cowell & Cragg, 1996. Eur J Cancer. 32A(10): 1749-52; Lohmann et al., 1996. 
Am J Hum Genet. 58(5):940-9; Shapiro et al.. 1995, Cancer Res. 55(24):6200-9; Huang et 
al.i 1993, Cancer Res. 53(8): 1889-94; and Cheng & Haas, 1990, Mol Cell Biol. 
10(lO):5502-9; the disclosures of which are hereby incorporated by reference in their 
entireties). Any gene encoding a premature translation codon implicated in retinoblastoma 
including, but not limited to, the nonsense mutations described in the references cited 
above, can be used in the present invention to identify compounds that mediate premature 
translation termination and/or nonsense-mediated mRNA decay. 

5.5.8. Wilm^s Tumor 
Wilm's tumor, or nephroblastoma, is an embryonal malignancy of the kidney 
that affects children. Nonsense mutations in the WTl gene have been implicated in Wihn's 
tumor. Several nonsense mutations in the WTl have been identified (see, e.g., Nakadate et 
al., 1999, Genes Chromosomes Cancer 25(l):26-32; DiUer et al., 1998, J Clin Oncol. 
16(ll):3634-40; Schumacher et al., 1997, Proc Natl Acad Sci USA. 94(8):3972-7; Coppes 
e/a/., 1993.PiocNatl Acad Sci USA. 90(4):1416-9; and Little et al., 1992, Proc Natl 
Acad Sci USA. 89(1 1):4791-5; the disclosures of which are hereby incorporated by 
reference in their entireties). Any WTl gene encoding a premature translation codon 
including, but not limited to, the nonsense mutations described in the references cited 
above, can be used in the present invention to identify compounds that mediate premature 
translation termination and/or nonsense-mediated mRNA decay. 

5.5.9. Retinitis Pigmentosa 
Retinitis pigmentosa is a genetic disease in which affected individuals 
develop progressive degeneration of the rod and cone photoreceptors. Retinitis pigmentosa 
cannot be explained by a single genetic defect but instead the hereditary aberration 
responsible for triggering the onset of the disease is localized in different genes and at 
different sites within these genes (reviewed in. e.g., Kohler et al, 1997, Klin Monatsbl 
Augenheilkd 21 1(2):84-93). Nonsense mutations have been implicated in retinitis 
pigmentosa (see, e.g., Ching et al, 2002, Neurology 58(1 1): 1673-4; Zhang et al, 2002, 
Zhonghua Yi Xue Yi Chuan Xue Za Zhi. 19(3):194-7; Zhang et al, 2002, Hum Mol Genet. 
1;11(9):993-1003; Dietrich et al, 2002, Br J Ophthalmol. 86(3):328-32; Grayson et al, 
2002, J Med Genet. 39(l):62-7; Liu etal, 2001, Zhonghua Yi Xue Za Zhi 81(2):71-2; 
Damji et al, 2001, Can J Ophthalmol. 36(5):252-9; Berson et al, 2001, Invest Ophthalmol 
Vis Sci. 42(10):2217-24; Chan et al, 2001. Br J Ophthalmol. 85(9):1046-8; Baum et al, 
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2001, Hum Mutat. 1 7(5):436; Mashima et al., 2001. Ophthalmic Genet. 22(l):43-7; 
Zwaenepoel et al., 2001, Hum Mutat. 2001;17(1):34-41; Bork et al., 2001, Am J Hum 
Genet. 68(l):26-37; Sharon et al., 2000, Invest Ophthalmol Vis Sci. 41(9):2712-21; Dreyer 
et al., 2000, Eur J Hum Genet. 8(7):500-6; Liu et al., 2000, Hum Mutat. 15(6):584; Wang et 
5 al., 1999, Exp Eye Res. 69(4):; Bowne et al., 1999, Hum Mol Genet. 8(1 1):2121-8; 
Guillonneau et al., 1999, Hum Mol Genet. 8(8):1541-6; Dryja et al., 1999, Invest 
Ophthalmol Vis Sci. 40(8): 1859-65; Sullivan etal., 1999, Nat Genet. 22(3):255-9; Pierce et 
al., 1999, Nat Genet. 22(3):248-54; Janecke et al, 1999, Hum Mutat. 13(2):133-40; Cuevas 
et al., 1998, Mol Cell Probes 12(6):417-20; Schwahn et al., 1998. Nat Genet. 19(4):327-32; 
1 0 Buraczynska et al. , 1 997, Am J Hum Genet. 6 1 (6): 1 287-92; Meindl et al. , 1 996, Nat Genet. 
13(l):35-42; Keen et al., 1996, Hum Mutat. 8(4):297-303; Dryja et al., 1995, Proc Natl 
Acad Sci USA. 92(22): 101 77-81; Apfelstedt-Sylla etal., 1995, Br J Ophthalmol. 
79(l):28-34; Bayes et al., 1995. Hum Mutat. 5(3):228-34; Shastry, 1994, Am J Med Genet. 
52(4):467-74; Gal et al., 1994, Nat Genet- 7(l):64-8; Sargan et aL, 1994. Gene Ther. 1 
15 Suppl 1 :S89; McLaughlin et al., 1993, Nat Genet. 4(2):130-4; Rosenfeld et al., 1992, Nat 
Genet. 1(3):209-13; the disclosures of which are hereby incorporated by reference in their 
entireties). Any gene encoding a premature translation codon implicated in retinitis 
pigmentosa including, but not limited to, the nonsense mutations described in the references 
cited above, can be used in the present invention to identify compounds that mediate 
20 premature translation termination and/or nonsense-mediated mRNA decay. 

5.5.10. Osteogenesis Imperfecta 

Osteogenesis imperfecta is a heterogeneous disorder of type I collagen 
resulting in varying degrees of severity and results from mutations the genes that encode the 
proalpha chains of type I collagen. Nonsense mutations have been implicated in the genes 

25 that encode the proalpha chains of type I collagen ("COLAl " genes) (see, e.g., Slayton et 
al., 2000, Matrix Biol. 19(l):l-9; Bateman et al, 1999, Hum Mutat. 13(4):31 1-7; and 
Willing et al., 1996, Am J Hum Genet. 59(4):799.809; the disclosures of which are hereby 
incorporated by reference in their entireties). Any COLAl gene encoding a premature 
translation codon including, but not limited to, the nonsense mutations described in the 

30 references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsense-mediated mRNA decay. 

5.5.1 1. Cirrhosis 
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Cirrhosis generally refers to a chronic liver disease that is marked by 
replacement of normal tissue with fibrous tissue. The multidmg resistance 3 gene has been 
implicated in cirrhosis, and nonsense mutations have been identified in this gene (see, e.g., 

Jacquemin et aL, 2001. Gastroenterology. 2001 May; 120(6): 1448-5 8; the 
disclosure of which is hereby incorporated by reference in its entirety). Any gene involved 
in cirrhosis encoding a premature translation codon including, but not limited to, the 
nonsense mutations described in the reference cited above, can be used in the present 
invention to identify compounds that mediate premamre translation termination and/or 
nonsense-mediated mRNA decay. 

5.5.12. Tay Sachs Disease 
Tay Sachs disease is an autosomal recessive disorder affecting the central 
nervous system. The disorder results fiom mutations in the gene encoding the alpha-subunit 
of beta-hexosaminidase A, a lysosomal enzyme composed of alpha and beta polypeptides. 
Several nonsense mutations have been impHcated in Tay Sachs disease (see. e.g., Rajavel & 
Neufeld. 2001, Mol Cell Biol. 21(16):5512-9; Myerowitz, 1997, Hum Mutat. 9(3):195-208; 
Akli et al., 1993, Hum Genet. 90(6):614-20; Mules et al., 1992, Am J Hum Genet. 
50(4):834-41;andAklicfa/., 1991. Genomics. ll(l):124-34; the disclosures of which are 
hereby incorporated by reference in their entireties). Any hexosaminidase gene encoding a 
premature translation codon including, but not limited to. the nonsense mutations described 
in the references cited above, can be used in the present invention to identify compounds 
that mediate premature translation termination and/or nonsense-mediated mRNA decay. 

5.5.13. RInod Disorders 
Hemophilia is caused by a deficiency in blood coagulation factors. Affected 
individuals are at risk for spontaneous bleeding into organs and treatment usually consists of 
administration of clotting factors. Hemophilia A is caused by a deficiency of blood 
coagulation factor VDI and hemophilia B is caused by a deficiency in blood coagulation 
factor IX. Nonsense mutations in the genes encoding coagulation factors have been 
implicated in hemophilia (see, e.g., Dansako et al, 2001, Ann Hematol. 80(5):292-4; 
Moller-Morlang et al., 1999. Hum Mutat. 13(6):504; Kamiya et al, 1998. Rinsho Ketsueki 
39(5):402-4; Freson et al, 1998. Hum Mutat. 1 1(6):470-9; Kamiya et al, 1995, Int J 
Hematol. 62(3):175-81; Walter et al., 1994, Thromb Haemost. 72(l):74-7; Figueiredo, 
1993, Braz J Med Biol Res. 26(9):919-31; Reiner & Thompson, 1992. Hum Genet. 
89(0:88-94; Koeberl et al, 1990. Hum Genet. 84(5):387-90; Driscoll et al, 1989. Blood. 
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74(2):737-42; Chen et ai, 1989, Am J Hum Genet. 44(4):567-9; Mikami et al., 1988, Jinrui 
Idengaku Zasshi. 33(4):409-15; Gitschier e/ a/., 1988, Blood 72(3): 1022-8; and Sommerc/ 
ai, 1987. Mayo Clin Proc. 62(5):387-404; the disclosures of which are hereby incorporated 
by reference in their entireties). Any gene encoding a premature translation codon 
5 implicated in hemophilia including, but not limited to, the nonsense mutations described in 
the references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsense-mediated mRNA decay. 

Von Willebrand disease is a single-locus disorder resulting from a deficiency 
of von Willebrand factor: a multimeric multifunctional protein involved in platelet adhesion 
10 and platelet-to-platelet cohesion in high shear stress vessels, and in protecting from 
proteolysis and directing circulating factor VIH to the site of injury (reviewed in 
Rodeghiero, 2002, Haemophilia. 8(3):292-300). Nonsense mutations have implicated in 
von WiUehbrand disease (see, e.g., Rodeghiero, 2002. Haemophilia. 8(3):292-300; Enayat 
etal, 2001, Blood 98(3):674-80; Surdhar et al, 2001, Blood 98(l):248-50; Casana et al., 
1 5 2000, Br J Haematol. 1 1 1 (2):552-5; Baronciani et al. , 2000, Thromb Haemost. 

84(4):536-40; Fellowes et al., 2000, Blood 96(2):773-5; Waseem et al., 1999, Thromb 
Haemost. 81(6):900-5; Mohlke et al., 1999, Int J Clin Lab Res. 29(l):l-7; Rieger et al, 
1998, Thromb Haemost. 80(2):332-7; Kenny et al., 1998, Blood 92(l):175-83; Mazurier et 
al., 1998, Ann Genet. 41(l):34-43; Hagiwara et ai, 1996. Thromb Haemost. 76(2):253-7; 
20 Mazurier & Meyer, 1996, Baillieres Clin Haematol. 9(2):229-41; Schneppenheim et ai, 
1994, Hum Genet. 94(6):640-52; Zhang et al., 1994, Genomics 2 1(1): 188-93; Ginsburg & 
Sadler, 1993, Thromb Haemost. 69(2): 177-84; Eikenboom et ai, 1992, Thromb Haemost. 
68(4):448-54; Zhang et al., 1992, Am J Hum Genet. 51(4):850-8; Zhang et ai, 1992, Hum 
Mol Genet. l(l):61-2; and Mancuso et al., 1991, Biochemistry 30(l):253-69; the 
25 disclosures of which are hereby incorporated by reference in their entireties). Any gene 

encoding a premature translation codon implicated in von Willebrand disease including, but 
not limited to, the nonsense mutations described in the references cited above, can be used 
in the present invention to identify compounds that mediate pranature translation 
termination and/or nonsense-mediated mRNA decay. 
30 0 thalassemia is caused by a deficiency in beta globin polypeptides which in 

turn causes a deficiency in hemoglobin production. Nonsense mutations have been 
implicated in b thalassemia (see, e.g., EI-Latif ai, 2002, Hemoglobin 26(l):33-40; 
Sanguansermsri et ai, 2001, Hemoglobin 25(l):19-27; Romao 2000, Blood 
96(8):2895-901; Perea et ai, 1999, Hemoglobin 23(3):231-7; Rhodes etal., 1999, Am J 
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Med Sci. 317(5):341-5; Fonseca et al., 1998, Hemoglobin 22(3): 197-207; Gasperini et al., 
1998, Am J Hematol. 1998 Jan;57(l):43-7; Galanello et al., 1997, Br J Haematol. 
99(2):433-6; Pistidda et al, 1997. Eur J Haematol. 58(5):320-5; Oner et al., 1997, Br J 
Haematol. 96(2):229-34; Yasunaga et al., 1995. Intern Med. 34(12): 1 198-200; MoUna et 
al., 1994, Sangre (Bare) 39(4):253-6; Chang et al., 1994, Int J Hematol. 59(4):267-72; 
Oilman et al., 1994, Am J Hematol. 45(3):265-7; Chan et al., 1993. Prenat Diagn. 
13(10):977-82; Oeorge et al., 1993, Med J Malaysia 48(3):325-9; Divoky et al., 1993. Br J 
Haematol. 83(3):523-4; Fioretti et al., 1993, Hemoglobin 17(1):9-17; Rosatelli et al., 1992. 
AmJHum Genet. 50(2):422-6; Moietal., 1992, Blood 79(2):512-6; Loudianos a/., 1992, 
Hemoglobin 16(6):503-9; Fukumaki, 1991, Rinsho Ketsueki 32(6):587-91; Cao et al., 1991, 
Am J Pediatr Hematol Oncol. 13(2):179-88; Galanello et al., 1990, Clin Genet. 
38(5):327-31 ; Liu, 1990, Zhongguo Yi Xue Ke Xue Yuan Xue Bao 12(2):90-5; 
Aulehla-Scholz et al., 1990, Hum Genet. 84(2):195-7; Cao et al., 1990. Ann N Y Acad Sci. 
612:215-25; Sanguansermsri et al., 1990, Hemoglobin 14(2):157.68; Galanello et al., 1989, 
Blood 74(2):823-7; Rosatelli et al., 1989. Blood 73(2):601-5; Galanello et al., 1989. Prog 
Clin Biol Res. 316B:1 13-21; Galanello et al., 1988. Am J Hematol. 29(2):63-6; Chan et al., 
1988, Blood 72(4): 1420-3; Atweh et al, 1988. J Clin Invest. 82(2):557-61; Masala et al, 
1988, Hemoglobin 12(5-6):661-71; Pirastu et al, 1987, Proc Natl Acad Sci U S A 
84(9):2882-5; Kazazian et al, 1986, Am J Hum Genet. 38(6):860-7; Cao et al, 1986, Prenat 
Diagn. 6(3):159-67; Cao et al, 1985, AnnN Y Acad Sci. 1985;445:380-92; Pirastu et al, 
1984, Science 223(4639):929-30; Pirasm et al, 1983, N Engl J Med. 309(5):284-7; 
Trecartin et al, 1981, J Clin Invest. 68(4):1012-7; and Liebhaber et al, 1981, Trans Assoc 
Am Physicians 94:88-96; the disclosures of which are hereby incorporated by reference in 
their entireties). Any gene encoding a premature translation codon implicated in b 
thalassemia including, but not limited to, the nonsense mutations described in the references 
cited above, can be used in the present invention to identify compounds that mediate 
premature translation termination and/or nonsense-mediated mRNA decay. 

5.5.14. Kidney Stones 
Kidney stones (nephrolithiasis), which affect 12% of males and 5% of 
females in the western world, are familial in 45% of patients and are most commonly 
associated with hypercalciuria (see. e.g., Lloyd et al. Nature 1996 Feb l;379(6564):445-9). 
Mutations of the renal-specific chloride channel gene are associated with hypercalciuric 
nephrolithiasis (kidney stones). Nonsense mutations have been implicated in kidney stones 
(see, e.g., Hoopes et al, 1998, Kidney Int. 54(3):698-705; Lloyd et al, 1997, Hum Mol 
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Genet. 6(8): 1233-9; Lloyd et al, 1996, Nature 379(6564):445-9; and Pras et aL, 1995, Am J 
Hum Genet. 56(6): 1297-303; the disclosures of which are hereby incorporated by reference 
in their entireties). Any gene encoding a premature translation codon implicated in kidney 
stones including, but not limited to, the nonsense mutations described in the references cited 
above, can be used in the present invention to identify compounds that mediate premature 
translation termination and/or nonsense-mediated mRNA decay. 

5.5.15. Ataxia-Telanfiiectasia 

Ataxia-telangiectasia is characterized by increased sensitivity to ionizing 
radiation, increased incidence of cancer, and neurodegeneration and is generally caused by 
mutations in the ataxia-telangiectasia gene (see, e.g.. Barlow et al., 1999, Proc Natl Acad 
Sci USA 96(1 7):991 5-9). Nonsense mutations have been implicated in ataxia-telangiectasia 
(see, e.g., Camacho et al, 2002, Blood 99(l):238-44; Pitts et ah, 2001, Hum Mol Genet. 
10(1 1):1 155-62; Laake et al, 2000, Hum Mutat. 16(3):232.46; Li & Swift, 2000, Am J Med 
Genet 92(3): 170-7; Teraoka etal, 1999, Am J Hum Genet. 64(6): 161 7-31; and 
Stoppa-Lyonnet et al, 1998, Blood 91(10):3920-6; the disclosures of which are hereby 
incorporated by reference in their entireties). Any gene encoding a premature translation 
codon implicated in ataxia-telangiectasia including, but not limited to, the nonsense 
mutations described in the references cited above, can be used in the present invention to 
identify compounds that mediate premature translation termination and/or nonsense- 
mediated mRNA decay. 

5.5.16. Lysosomal Storage Diseases 

There are more than 40 individually recognized lysosomal storage disorders. 
Each disorder results from a deficiency in the activity of a specific enzyme, which impedes 
the lysosome from carrying out its normal degradative role. These include but are not 
limited to the diseases listed subsequently. Aspartylglucosaminuria is caused by a 
deficiency of N-aspartyl-beta-glucosaminidase (Fisher et al., 1990, FEBS Lett. 269:440- 
444); cholesterol ester storage disease (Wolman disease) is caused by mutations in the LBPA 
gene (Fujiyama et al., 1996, Hum. Mutat. 8:377-380); mutations in the CTNS gene are 
associated with cystinosis (Town et al., 1998, Nature Genet. 18:319-324); mutations in a- 
galactosidase A are associated with Fabry disease (Eng et al.,1993, Pediat. Res. 33:128A; 
Sakuraba et al., 1990, Am. J. Hum. Genet. 47:784-789; Davies et al., 1993, Hum. Molec. 
Genet. 2:1051-1053; Miyamura et al., 1996, J. Clin. Invest. 98:1809-1817); fiicosidosis is 
caused by mutations in the FUCAl gene(Kretz et al., 1989. J. Molec. Neurosci. 1:177-180; 
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Yang et al., 1992, Biochem. Biophys. Res. Commun. 189:1063-1068; Seo et al., 1993, 
Hum. Molec. Genet. 2:1205-1208); mucolipidosis type I results from mutations in the 
NEUl gene (Bonten et al..l996, Genes Dev. 10:3156-3169); mucolipidosis type IV results 
from mutations in theMCOLNl gene (Bargal et al.. 2000, Nature Genet. 26:120-123; Sun et 
al., 2000.Hum. Molec. Genet. 9:2471-2478); Mucopolysaccharidosis type I (Hurler 
syndrome) is caused by mutations in the IDUA gene (Scott et al., 1992, Genomics 13:1311- 
1313;Bach et al., 1993, Am. J. Hum. Genet. 53:330-338); Mucopolysaccharidosis type II 
(Hunter syndrome) is caused by mutations in the IDS gene (Sukegawa et al., 1992, 
Biochem. Biophys. Res. Commun. 183:809-813; Bunge et al., 1992 Hum. Molec. Genet. 
1:335-339; Flomen et al., 1992, Genomics 13:543-550); mucopolysaccharidosis type 25IIIB 
(Sanfilippo syndrome type A) is caused by mutations in the SGSH gene(Yogalingam et al., 
2001, Hum. Mutat. 18:264-281); mucopolysaccharidosis type IIIB (Sanfilippo syndrome) is 
caused by mutations in the NAGLU gene (Zhao et al., 1996. Proc. Nat. Acad. Sci. 93:6101- 
6105; Zhao et al., 1995. Am. J. Hum. Genet. 57:A185); mucopolysaccharidosis type HID is 
15 caused by mutations in the glucosamine-6-sulfatase (G6S) gene (Robertson et al., 1 988, 

Hum. Genet. 79:175-178); mucopolysaccharidosis type IVA (Morquio syndrome) is caused 
by mutations in the GALNS gene (Tomatsu et al., 1995, Am. J. Hum. Genet. 57:556-563; 
Tomatsu et al.,1995. Hum. Mutat. 6:195-196); mucopolysaccharidosis type VI (Maroteaux- 
Lamysyndrome) is caused by mutations in the ARSB gene (Litjens et al., 1992. Hum. 
20 Mutat. 1 :397-402; Isbrandt et al.. 1996, Hum. Mutat. 7:361-363); mucopolysaccharidosis 
type vn (Sly syndrome) is caused by mutations in the beta-glucuronidase (GUSB) 
gene(Yamada et al.. 1995, Hum. Molec. Genet. 4:651-655); mutations in CLNl (PPTl) 
cause infantile neuronal ceroid lipofuscinosis (Das et al., 1998 J. Clin. Invest. 102:361-370; 
Mitchison et al.. 1998, Hum. Molec. Genet. 7:291-297); late infantile type ceroid 
25 lipofuscinosis is caused by mutations in the CLN2 gene (Sleat et al., 1997, Science 
277: 1802-1805); juvenile neuronal ceroid lipofuscinosis (Batten disease) is caused 
bymutations in the CLN3 gene (Mole et al., 1999. Hum. Mutat. 14: 199-215); late 
infantileneuronal ceroid lipofuscinosis, Finnish variant, is caused by mutations in the CLN5 
gene(Savukoski et al., 1998. Nature Genet. 19:286-288); late-infantile form of neuronal 
30 ceroid lipofuscinosis is caused by mutations in the CLN6 gene (Gao et al., 2002, Am. 

J.Hum. Genet. 70:324-335); Niemann-Pick disease is caused by mutations in the ASM gene 
(Takahashi et al., 1992, J. Biol. Chem. 267:12552-12558; types A and B) and theNPCl 
gene (Millat et al., 2001, Am. J. Hum. Genet. 68:1373-1385; type C); Kanzaki disease is 
caused by mutations in the NAGA gene (Keulemans et et al.. 1996. J. Med.Genet. 33:458- 
35 464); Gaucher disease is caused by mutations in the GB A gene (Stone, et al., 1 999, Europ. 
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J. Hum. Genet. 7:505-509); Glycogen storage disease II is the prototypic lysosomal storage 
disease and is caused by mutations in the GAA gene(Becker et al., 1998, Am. J. Hum. 
Genet. 62:991-994); Krabbe disease is caused by mutations in the GALC gene (Sakai et a!., 
1994, Biochem. Biophys. Res. Commun. 198:485-491); Tay-Sachs disease is caused by 
mutations in the HEXA gene (Akli etal., 1991, Genomics 11:124-134; Mules etaL, 1992, 
Am. J. Hum. Genet. 50: 834-84 l;Triggs-Raine et al., 1991, Am. J. Hum. Genet. 49:1041- 
1054; Drucker et al., 1993, Hum. Mutat. 2:415-417; Shore et al., 1992, Hum. Mutat. 1:486- 
490); mutations in the GM2Agene causes Tay-Sachs variant AB (Schepers et al., 1996, Am. 
J. Hum. Genet. 59:1048-1056; Chen et al., 1999, Am. J. Hum. Genet. 65:77-87); mutations 
in the HEXB gene cause Sandhoff disease (Zhang et al., 1994, Hum Mol Genet 3:139-145); 
alphamannosidosis type II is caused by mutations in the MAN2B1 gene (Gotoda et al., 
1998, Am. J. Hum. Genet. 63:1015-1024; Autio et al., 1973, Acta Paediat. Scand. 62:555- 
565); metachromatic leukodystrophy is caused by mutations in the ARSA gene(Gieselmann 
et al., 1994, Hum. Mutat. 4:233-242). Any gene containing a premature translation codon 
implicated in lysosomal storage disease disorders including, but not limited to, the nonsense 
mutations and genes described in the references cited above, can be used in the present 
invention to identify compounds that mediate premature translation termination and/or 
nonsense-mediated mRNA decay. 

5.5.17. Tuberous Sclerosis 

Tuberous sclerosis complex (TSC) is a dominantly inherited disease 
characterized by the presence of hamartomata in multiple organ systems. The disease is 
caused bymutations in TSCl (van Slegtenhorst et al, 1997 Science 277:805-808; Sato et 
ai, 2002, J. Hum. Genet. 47:20-28) and/or TSC2 (Vrtel et al, 1996, J. Med. Genet. 33:47- 
51; Wilson et al, 1996, Hum. Molec. Genet. 5:249-256; Au et aL, 1998, Am. J.Hum. Genet. 
62:286-294; Verhoef et aL, 1999, Europ. J. Pediat. 158:284.287;CarsiIlo et aL, 2000, Proc. 
Nat. Acad. Sci. 97:6085-6090). Any gene containing a premature translation codon 
implicated in tuberous sclerosis including, but not limited to, the nonsense mutations 
described in the references cited above, can be used in the present invention to identify 
compounds that mediate premature translation termination and/or nonsense-mediated 
mRNA decay. 

5.6. Secondary Biological Screens or Assays 
5.6.1. In vitro Assays 
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The compounds identified in the assays described supra (for convenience 
referred to herein as a "lead" compound) can be tested for biological activity using host 
cells containing or engineered to contain a gene of interest with a premature stop codon or 
nonsense mutation coupled to a functional readout system. For example, a phenotypic or 
physiological readout can be used to assess the premature translation termination and/or 
nonsense-mediated mRNA decay of the RNA product encoded by the gene of interest in the 
presence and absence of the lead compound. 

In one embodiment, a phenotypic or physiological readout can be used to 
assess the premature translation termination and/or nonsense-mediated mRNA decay of an 
RNA product of interest in the presence and absence of the lead compound. In accordance 
with this embodiment, cell-based and cell-firee assays described herein, or in International 
Publication No. WO 01/44516 (which is incorporated herein by reference in its entirety) 
may be used to assess the premature translation termination and/or nonsense-mediated 
mRNA decay of the RNA product of interest. Where the gene product of interest is 
involved in cell growth or viability, the in vivo effect of the lead compound can be assayed 
by measuring the cell growth or viability of the target cell. Such assays can be carried out 
with representative cells of cell types involved in a particular disease or disorder (e.g., 
leukocytes such as T cells, B cells, natural killer cells, macrophages, neutrophils and 
eosinophils). A lower level of proliferation or survival of the contacted cells indicates that 
the lead compound is effective to treat a condition in the patient characterized by 
uncontrolled cell growth. Alternatively, instead of culturing cells fi-om a patient, a lead 
compound may be screened using cells of a tumor or malignant cell line or an endothelial 
cell line. Specific examples of cell culture models include, but are not limited to, for lung 
cancer, primary rat lung tumor ceUs (see, e.g., Swafford et al.. 1997, Mol. Cell. Biol., 
17:1366-1374) and large-cell undifferentiated cancer cell lines (see, e.g., Mabry et al., 1991, 
Cancer Cells, 3:53-58); colorectal cell lines for colon cancer (see, e.g.. Park & Gazdar, 
1996, J. Cell Biochem. Suppl. 24:131-141); multiple established cell lines for breast cancer 
(see, e.g., Hambly et al., 1997, Breast Cancer Res. Treat. 43:247-258; Gierthy et al., 1997, 
Chemosphere 34:1495-1505; and Prasad & Church, 1997, Biochem. Biophys. Res. 
Commun. 232:14-19); a number of well-characterized cell models for prostate cancer (see, 
e.g., Webber et al., 1996, Prostate, Part 1, 29:386-394; Part 2, 30:58-64; and Part 3. 30:136- 
142 and Boulikas, 1997, Anticancer Res. 17:1471-1505); for genitourinary cancers, 
continuous human bladder cancer cell lines (see, e.g., Ribeiro et al., 1997, Int. J. Radiat. 
Biol. 72:1 1-20); organ cultures of transitional cell carcinomas (see, e.g., Booth et al., 1997, 
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Lab Invest. 76:843-857) and rat progression models (see, e.g.. Vet et al., 1997, Biochim. 
Biophys Acta 1360:39-44); and established cell lines for leukemias and lymphomas (see, 
e.g., Drexler, 1994, Leuk, Res. 18:919-927 and Tohyama, 1997, Int. J. Hematol. 65:309- 
317). 

Many assays well-known in the art can be used to assess the survival and/or 
growth of a patient cell or cell line following exposure to a lead compound; for example, 
cell proliferation can be assayed by measuring bromodeoxyuridine (BrdU) incorporation 
(see, e.g., Hoshino et al., 1986, Int. J. Cancer 38:369 and Campana et al., 1988, J. Immunol. 
Meth. 107:79) or (3H)-thymidine incorporation (see, e.g., Chen, 1996, Oncogene 13:1395- 
403 and Jeoung, 1995, J. Biol. Chem. 270:18367-73), by direct cell count, by detecting 
changes in transcription, translation or activity of known genes such as proto-oncogenes 
(e.g., fos, myc) or cell cycle markers (Rb, cdc2, cyclin A, Dl, D2, D3, E, etc.). The levels 
of such protein and mRNA and activity can be detennined by any method well known in the 
art. For example, protein can be quantitated by known immunodiagnostic methods such as 
western blotting or immunoprecipitation using commercially available antibodies. mRNA 
can be quantitated using methods that are well known and routine in the art, for example, 
using northern analysis, RNase protection, the polymerase chain reaction in connection with 
reverse transcription C*RT-PCR"). Cell viability can be assessed by using trypan-blue 
staining or other cell death or viability markers known in the art. In a specific embodiment, 
the level of cellular ATP is measured to detemiined cell viability. Differentiation can be 
assessed, for example, visually based on changes in morphology. 

The lead compound can also be assessed for its ability to inhibit cell 
transformation (or progression to malignant phenotype) in vitro. In this embodiment, cells 
with a transformed cell phenotype are contacted with a lead compound, and examined for 
change in characteristics associated with a transformed phenotype (a set of in vitro 
characteristics associated with a tumorigenic ability in vivo), for example, but not limited 
to, colony formation in soft agar, a more rounded cell morphology, looser substratum 
attachment, loss of contact inhibition, loss of anchorage dependence, release of proteases 
such as plasminogen activator, increased sugar transport, decreased serum requirement, or 
expression of fetal antigens, etc. (see, e.g., Luria et al., 1978, General Virology, 3d Ed., 
John Wiley & Sons, New York, pp. 436-446). 

Loss of invasiveness or decreased adhesion can also be assessed to 
demonstrate the anti-cancer effects of a lead compound. For example, an aspect of the 
foraiation of a metastatic cancer is the ability of a precancerous or cancerous cell to detach 

-79- 



(' ( 

WO20«4/0«1010 PCT/US2003/019760 
from primary site of disease and establish a novel colony of growth at a secondary site. The 
ability of a cell to invade peripheral sites reHects its potential for a cancerous state. Loss of 
invasiveness can be measured by a variety of techniques known in the art including, for 
example, induction of E-cadherin-mediated cell-cell adhesion. Such E-cadherin-mediated 
adhesion can result in phenotypic reversion and loss of invasiveness (see. e.g., Hordijk et 

al.. 1997, Science 278:1464-66). 

Loss of invasiveness can further be examined by inhibition of cell migration. 
A variety of 2-dimensional and 3-dimensional cellular matrices are commercially available 
(Calbiochem-Novabiochem Corp. San Diego, CA). Cell migration across or into a matrix 
can be examined using microscopy, time-lapsed photography or videography, or by any 
method in the art allowing measurement of cellular migration. In a related embodiment, 
loss of invasiveness is examined by response to hepatocyte growth factor ("HGF"). HGF- 
induced cell scattering is correlated with invasiveness of ceUs such as Madin-Darby canine 
kidney ("MDCK") cells. This assay identifies a cell population that has lost cell scattering 
activity in response to HGF (see. e.g., Hordijk et al.. 1997. Science 278:1464-66). 

Alternatively, loss of invasiveness can be measured by cell migration 
through a chemotaxis chamber (Neuroprobe/ Precision Biochemicals Inc. Vancouver, BC). 
In such assay, a chemo-attractant agent is incubated on one side of the chamber (e.g.. the 
bottom chamber) and cells are plated on a filter separating the opposite side {e.g., the top 
chamber). In order for cells to pass from the top chamber to the bottom chamber, the cells 
must actively migrate through small pores in the filter. Checkerboard analysis of the 
number of cells that have migrated can then be correlated with invasiveness (see e.g., 
Ohnishi, 1993, Biochem. Biophys. Res. Comraun.l93:5 18-25). 

A lead compound can also be assessed for its ability to alter the expression of 
a secondary protein (as determined. e.g. by western blot analysis) or RNA, whose 
expression and/or activation is regulated directly or indirectly by the gene product of a gene 
of interest containing a prematiire stop codon or a nonsense mutation (as determined, e.g., 
by RT-PCR or northern blot analysis) in cultured cells in vitro using methods which are 
well known in the art. Further, chemical footprinting analysis can be conducted as 
described herein (see, e.g.. Example 7) or also well-known in die art. 



5.6.2. Animal Models 

Animal model systems can be used to demonsti^te the safety and efficacy of 
the lead compounds identified in the nonsense suppression assays described above. The 
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lead compounds identified in the nonsense suppression assay can then be tested for 
biological activity using animal models for a disease, condition, or syndrome of interest. 
These include animals engineered to contain the target RNA element coupled to a fimctional 
readout system, such as a transgenic mouse. 

There are a number of methods that can be used to conduct animal model 
studies. Briefly, a compound identified in accordance with the methods of the invention is 
introduced into an animal model so that the effect of the compound on the manifestation of 
disease can be determined. The prevention or reduction in the severity, duration or onset of 
a symptom associated with the disease or disorder of the animal model that is associated 
with, characterized by or caused by premature translation termination and/or nonsense 
mediated mRNA decay would indicate that the compound adminstered to the animal model 
had a prophylactic or therapeutic effect. Any method can be used to introduce the 
compound into the animal model, including, but not limited to, injection, intravenous 
infusion, oral ingestion, or inhalation. In a preferred embodiment, transgenic hosts are 
constructed so that the animal genome encodes a gene of interest with a premature 
translation termination sequence or stop codon. In such an embodiment, the gene, containg 
a premature translation termination sequence or stop codon, would not encode a fiill length 
peptide from a transcribed mRNA. The adminsitration of a compound to the animal model, 
and the expression of a fiill length protein, polypeptide or peptide, for example, 
corresponding to the gene containing a premature stop codon would indicate that the 
compound modulates premature translation termination. Any method known in the art, or 
described herein, can be used to determine if the stop codon was modulated by the 
compound. In another embodiment, an animal is transfected with a reporter construct 
comprising a regulatory element operably linked to a reporter gene so that the expression 
the reporter gene is regulated by a regulatory protein or subunit thereof encoded by a 
nucleic aicd sequence that contains a premature translation termination sequence or stop 
codon suppression. In such an embodiment, the animal can be cotransfected with a 
recombinant vector comprising the nucleic acid sequence encoding the regulatory protein 
with a premature stop codon. In another embodiment, the animal host genome encodes a 
native gene containing a premature stop codon. In yet another embodiment of the 
mvention, the animal host is a natural mutant, i.e., natively encoding a gene with a 
premature stop codon. For example, the animal can be a model for cystic fibrosis wherein 
the animal genome contains a natural mutation that incorporates a premature stop codon or 
translation termination sequence. 
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Examples of animal models for cystic fibrosis include, but are not limited to, 
cftr(-/-) mice (see, e.g., Freedman et al., 2001, Gastroenterology 121(4):950-7), 
cftr(tmlHGU/tmlHGU) mice (see, e.g., Bemhard et ai, 2001, Exp Lung Res 27(4):349- 
66), CFTR-deficient mice with defective cAMP-mediated Cl(-) conductance (see, e.g., 
Stotland e/ a/., 2000, Pediatr Pulmonol 30(5):4 13-24), C57BL/6- 
Cftr(mlUNC)/Cftr(mlUNC) knockout mice (see, e.g., Stotland et al., 2000, Pediatr 
Pulmonol 30(5):413-24), an animal model of the human airway, using bronchial xenografts 
engrafted on rat tracheas and implanted into nude mice (see, e.g., Engelhardt et al., 1992, J. 
Clin. Invest. 90: 2598-2607), a transgenic mouse model of cystic fibrosis (see, e.g., Clarke 
et al., 1992, Science 257: 1 125-1 128; Colledge et al., 1992. Lancet 340: 680 only; Dorin et 
al, 1992. Nature 359: 211-215; Snouwaert et al., 1992, Science 257: 1083-1088; Manson et 
al.. 1997. EMBO J. 16: 4238-4249). 

Examples of animal models for muscular dystrophy include, but are not 
limited to, mouse, hamster, cat, dog, and C. elegans. Examples of mouse models for 
muscular dystrophy include, but are not limited to. the dy-/- mouse (see, e.g., Connolly et 
al., 2002, J Neurounmunol 127(l-2):80-7), a muscular dystrophy with myositis (mdm) 
mouse mutation (see, e.g., Garvey et al., 2002. Genomics 79(2):146-9). the mdx mouse 
(see. e.g, Nakamura et al., 2001, Neuromuscul Disord 1 1(3):251-9). the utrophin- 
dystrophin knockout (dko) mouse (see, e.g., Nakamura et al, 2001, Neuromuscul Disord 
1 1(3):251-9), the dy/dy mouse (see, e.g, Dubowitz et al, 2000, Neuromuscul Disord 10(4- 
5):292-8), the mdx(Cv3) mouse model (see. e.g.. Fillers et al, 1999, Laryngoscope 
109(8):1310-2), and the myotonic ADR-MDX mutant mice (see, e.g., Kramer et al, 1998, 
Neuromuscul Disord 8(8):542-50). Examples of hamster models for muscular dystrophy 
include, but are not limited to. sarcoglycan-deficient hamsters (see. e.g., Nakamura et al, 
2001. Am J Physiol Cell Physiol 281(2):C690-9) and the BIO 14.6 dystrophic hamster (see, 
e.g., Schlenker & Burbach. 1991, J Appl Physiol 7 1(5): 1655-62). An example of a fehne 
model for muscular dystrophy includes, but is not limited to, the hypertrophic feline 
muscular dystrophy model (see. e.g., Gaschen & Burgunder, 2001. Acta Neuropathol (Berl) 
l01(6):591-600). Canine models for muscular dystrophy include, but are not limited to, 
golden retriever muscular dystrophy (see, e.g., Fletcher et al, 2001, Neuromuscul Disord 
1 1(3):239-43) and canine X-linked muscular dystrophy (see, e.g., Valentine et al. 1992, 
Am J Med Genet 42(3):352-6). Examples of C. elegans models for muscular dystrophy are 
described in Chamberlain & Benian. 2000. Curr Biol 10(21):R795-7 and Culette & Sattelle, 
2000. Hum Mol Genet 9(6):869-77. Also, a mouse model for Duchenne type muscular 
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dystrophy has been used to show that treatment with anabolic steroids increases myofiber 
damage (see, e.g., Krahn et al., 1994, J. Neurol. Sci. 125: 138-146). A feline model for 
Duchenne type muscular dystrophy has also been described (see, e.g., Winand et al., 1994, 
4: 433-445). 

Examples of animal models for familial hypercholesterolemia include, but 
are not limited to, mice lacking functional LDL receptor genes (see, e.g., Aji et al., 1997, 
Circulation 95(2):430-7), Yoshida rats (see, e.g., Fantappie et al., 1992, Life Sci 
50(24): 191 3-24), the JCR:LA-cp rat (see, e.g., Richardson et al., 1998, Atherosclerosis 
138(1): 135-46), swine (see, e.g., Hasler-Rapacz et al., 1998, Am J Med Genet 76(5):379- 
86), the Watanabe heritable hyperiipidaemic rabbit (see, e.g., Tsutsumi etal., 2000, 
Arzneimittelforschung 50(2): 1 18-21; Harsch et al., 1998, Br J Phannacol 124(2):227-82; 
and Tanaka et al., 1995, Atherosclerosis 1 14(l):73-82); and a family of rhesus monkeys 
with hypercholesterolemia due to deficiency of the LDL receptor (see, e.g., Scanu et al., 
1988, J. Lipid Res. 29: 1671-1681). 

An example of an animal model for human cancer in general includes, but is 
not limited to, spontaneously occurring tumors of companion animals (see, e.g.. Vail & 
MacEwen, 2000, Cancer Invest 18(8):781-92). Examples of animal models for lung cancer 
include, but are not limited to, lung cancer animal models described by Zhang & Roth 
(1994, In Vivo 8(5):755-69) and a transgenic mouse model with disrupted p53 function 
(see, e.g. Moms et al., 1998, J La State Med Soc 1 50(4): 1 79-85). An example of an animal 
model for breast cancer includes, but is not limited to, a transgenic mouse that 
overexpresses cyclin Dl (see, e.g., Hosokawa et al., 2001, Transgenic Res 10(5):471-8). 
An example of an animal model for colon cancer includes, but is not limited to, a TCRbeta 
and p53 double knockout mouse (see, e.g., Kado et al., 2001, Cancer Res 61(6):2395-8). 
Examples of animal models for pancreatic cancer include, but are not limited to, a 
metastatic model of Panc02 murine pancreatic adenocarcinoma (see, e.g., Wang et al., 2001, 
Int J Pancreatol 29(l):37-46) and nu-nu mice generated in subcutaneous pancreatic tumours 
(sec, e.g, Ghaneh et al., 2001, Gene Ther 8(3): 199-208). Examples of animal models for 
non-Hodgkin's lymphoma include, but are not limited to, a severe combined 
immunodeficiency ("SCID") mouse (see, e.g., Bryant et al., 2000, Lab Invest 80(4):553-73) 
and an IgHmu-HOXl 1 transgenic mouse (see, e.g.. Hough et al., 1998, Proc Natl Acad Sci 
USA 95(23): 13853-8). An example of an animal model for esophageal cancer includes, but 
is not limited to, a mouse transgenic for the human papillomavirus type 16 E7 oncogene 
(see, e.g., Berber et al., 1996, J Virol 70(3): 1873-81). Examples of animal models for 
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colorectal carcinomas include, but are not limited to. Ape mouse models (see. e.g., Fodde & 
Smits. 2001. Trends Mol Med 7(8):369-73 and Kuraguchi et al., 2000. Oncogene 
19(50):5755-63). An example of an animal model for neurofibromatosis includes, but is not 
limited to. mutant NFl mice (see. e.g., Cichowski et al., 1996. Semin Cancer Biol 7(5):291- 
5 8). Examples of animal models for retinoblastoma include, but are not limited to. 

transgenic mice that expression the simian virus 40 T antigen in the retina (see. e.g., Howes 
et al., 1994, Invest Ophthalmol Vis Sci 35(2):342-51 and Windle et al, 1990, Nature 
343(6259):665-9) and inbred rats (see, e.g. , Nishida et al., 1981, Cuit Eye Res 1(0:53-5 
and Kobayashi et al., 1982. Acta Neuropathol (Berl) 57(2-3):203-8). Examples of animal 
0 models for Wilm's tumor include, but are not limited to, a WTl knockout mice (see, e.g., 
Schamhorst et al., 1997. Cell Growth Differ 8(2): 1 33-43). a rat subline with a high 
incidence of neuphroblastoma (see, e.g., Mesfin & Breech, 1996. Lab Anim Sci 46(3):321. 
6). and a Wistar/Furth rat with Wilms' tumor (see. e.g.. Murphy et al.. 1987. Anticancer Res 
7(4B):717-9). 

5 Examples of animal models for retinitis pigmentosa include, but are not 

limited to, the Royal College of Surgeons ("RCS*') rat (see. e.g., VoUrath et al., 2001. Proc 
Natl Acad Sci USA 98(22); 12584-9 and Hanitzsch et al., 1998. Acta Anat (Basel) 162(2- 
3):1 19-26). a rhodopsin knockout mouse (see, e.g., Jaissle et al., 2001. Invest Ophthalmol 
Vis Sci 42(2):506-13), Wag/Rij rats (see. e.g., Lai et al., 1980, Am J Pathol 98(l):281-4). 
20 Examples of animal models for cirrhosis include, but are not limited to, 

ecu-exposed rats (see. e.g., Kloehn et al., 2001. Horm Metab Res 33(7):394-401) and 
rodent models instigated by bacterial cell components or colitis (see. e.g., Vierling. 2001. 
Best Pract Res Clin Gastroenterol 15(4):591-610). 

Examples of animal models for hemophilia include, but are not hmited to, 
25 rodent models for hemophilia A (see. e.g., Reipert et al., 2000, Thromb Haemost 84(5):826- 
32; Jarvis et al.,. 1996. Thromb Haemost 75(2):318-25; and Bi et al., 1995, Nat Genet 
10(1): 119-21). canine models for hemophilia A (see. e.g., Gallo-Penn et al., 1999. Hum 
Gene Ther 10(1 1):1791-802 and Connelly et al, 1998, Blood 91(9);3273-81). murine 
models for hemophilia B (see, e.g., Snyder et al., 1 999, Nat Med 5(1):64.70; Wang et al., 
30 1997, Proc Natl Acad Sci USA 94(21):1 1563-6; and Fang et al., 1996, Gene Ther 3(3):217- 
22). canine models for hemophilia B (see, e.g.. Mount et al., 2002, Blood 99(8):2670-6; 
Snyder et al., 1999. Nat Med 5(l):64-70; Fang et al., 1996. Gene Ther 3(3):217-22); and 
Kay et al., 1994. Proc Natl Acad Sci USA 91(6):2353-7), and a rhesus macaque model for 
hemophilia B (see, e.g., Lozier et al., 1999, Blood 93(6):1875-81). 
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Examples of animal models for von Willebrand disease include, but are not 
limited to, an inbred mouse strain RIIIS/J (see, e.g., Nichols et al., 1994, 83(1 1):3225-31 
and Sweeney et al., 1990, 76(1 1):2258-65), rats injected with botrocetin (see, e.g., Sanders 
et al., 1988, Lab Invest 59(4):443-52), and porcine models for von Willebrand disease (see, 
e.g., Nichols et al., 1995, Proc Natl Acad Sci USA 92(7):2455-9; Johnson & Bowie, 1992, J 
Lab Clin Med 120(4):553-8); and Brinkhous et al., 1991, Mayo Clin Proc 66(7):733-42). 

Examples of animal models for b-thalassemia include, but are not limited to, 
murine models with mutations in globin genes (see, e.g., Lewis et al., 1998, Blood 
91(6):2152-6; Raja et al., 1994, Br J Haematol 86(l):156-62; Popp et al., 1985, 445:432-44; 
and Skow et al., 1983, Cell 34(3): 1043-52). Ciavatta and co-workers created a mouse 
model of beta-zero-thalassemia by targeted deletion of both adult beta-like globin genes, 
beta(maj) and beta(min), in mouse embryonic stem cells (see, e.g., Ciavatta et al., 1995, 
Proc Natl Acad Sci U S A. Sep 26;92(20):9259-63). 

Examples of animal models for kidney stones include, but are not limited to, 
genetic hypercalciuric rats (see, e.g., Bushinsky et al., 1999, Kidney Int 55(l):234-43 and 
Bushinsky et al., 1995, Kidney Int 48(6): 1705-13), chemically treated rats (see, e.g., Grases 
et al., 1998, Scand J Urol Nephrol 32(4):261-5; Burgess et al., 1995, Urol Res 23(4):239- 
42; Kumar et al., 1991, J Urol 146(5): 1384-9; Okada et al., 1985, Hinyokika Kiyo 
31(4):565-77; and Bluestone et al., 1975, Lab Invest 33(3):273-9), hyperoxaluric rats (see, 
e.g., Jones et al., 1991, J Urol 145(4):868-74), pigs with unilateral retrograde flexible 
nephroscopy (see, e.g., Seifmah et al., 2001, 57(4):832-6), and rabbits with an obstructed 
upper urinary tract (see, e.g., Itatani et al., 1979, Invest Urol 17(3):234-40). 

Examples of animal models for ataxia-telangiectasia include, but are not 
limited to, murine models of ataxia-telangiectasia (see, e.g.. Barlow et al., 1999, Proc Natl 
Acad Sci USA 96(I7):9915-9 and Inoue et al., 1986. Cancer Res 46(8):3979-82). A mouse 
model was generated for ataxia-telangiectasia using gene targeting to generate mice that did 
not express the Atm protein (see, e.g.. Bison et al., 1996, Proc. Nat. Acad. Sci. 93: 13084- 
13089). 

Examples of animal models for lysosomal storage diseases include, but are 
not limited to, mouse models for mucopolysaccharidosis type VII (see, e.g.. Brooks et al., 
2002, Proc Natl Acad Sci USA. 99(9):62 16-21; Monroy et al.. 2002, Bone 30(2):352-9; 
Vogler et al., 2001 , Pediatr Dev Pathol. 4(5):421-33; Vogler et al., 2001, Pediatr Res. 
49(3):342-8; and Wolfe et al., 2000, Mol Ther. 2(6):552-6), a mouse model for 
metachromatic leukodystrophy (see, e.g.. Matzner et al., 2002, Gene Ther. 9(l):53-63), a 
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mouse model of Sandhoff disease (see, e.g., Sango et al., 2002, Neuropathol Appl 
Neurobiol. 28(l):23-34), mouse models for mucopolysaccharidosis type III A (see, e.g., 
Bhattacharyya et al., 2001, Glycobiology 1 1(1):99-10 and Bhaumik et al, 1999, 
Glycobiology 9(1 2): 1389-96.), arylsulfatase A (ASA)-deficient mice (see. e.g., D'Hooge et 
al., 1999. Brain Res. 847(2):352-6 and D'Hooge et al, 1999. Neurosci Lett. 273(2):93-6); 
mice with an aspartylglucosaminuria mutation (see, e.g., Jalanko et al., 1998, Hum Mol 
Genet. 7(2):265-72); feline models of mucopolysaccharidosis type VI (see. e.g., Crawley et 
al, 1998, J Clin Invest. 101(1):109-19 and Norrdin et al.. 1995, Bone 17(5):485-9); a feline 
model of Niemann-Pick disease type C (see, e.g., March et al, 1997, Acta Neuropathol 
(Bed). 94(2): 164-72); acid sphingomyelinase-deficient mice (see, e.g., Otterbach & Stoffel. 
1995, Cell 8 1 (7): 1053-6), and bovine mannosidosis (see, e.g.. Jolly et al., 1975. Birth 
Defects Orig Artie Ser. 1 1(6):273-8). 

Examples of animal models for tuberous sclerosis ("TSC") include, but are 
not limited to. a mouse model of TSCl (see, e.g., Kwiatkowski et al, 2002, Hum Mol 
Genet. 1 1(5):525-34). a Tscl (TSCl homologue) knockout mouse (see, e.g., Kobayashi et 
al, 2001, Proc NaU Acad Sci USA. 2001 Jul 17;98(15):8762-7). a TSC2 gene 
mutant(Eker) rat model (see, e.g., Hino 2000, Nippon Rinsho 58(6):1255-61; Mizuguchi et 
al, 2000, J Neuropathol Exp Neurol. 59(3):188-9; and Hino et al, 1999. Prog Exp Tumor 
Res. 35:95-108); and Tsc2(+/-) mice (see. e.g., Onda et al., 1999, J Clin Invest 
104(6):687-95). 



5.63. Toxicity 

The toxicity and/or efficacy of a compound identified in accordance with the 
invention can be determined by standard pharmaceutical procedures in cell cultures or 
experimental animals, e.g., for determining the LDso (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population). 
Cells and cell lines that can be used to assess the cytotoxicity of a compound identified in 
accordance with the invention include, but are not limited to, peripheral blood mononuclear 
cells (PBMCs), Caco-2 cells, and Huh7 cells. The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as the ratio LDso/EDso- A compound 
identified in accordance with the invention that exhibits large therapeutic indices is 
preferred. While a compound identified in accordance with the invention that exhibits toxic 
side effects may be used, care should be taken to design a delivery system that targets such 



-86- 



< ( 

wo 2004/001010 ' PCT/US2003/019760 

agents to the site of affected tissue in order to minimize potential damage to iminfected cells 
and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used 
in formulating a range of dosage of a compound identified in accordance with the invention 
for use in humans. The dosage of such agents lies preferably within a range of circulating 
concentrations that include the ED50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration 
utilized. For any agent used in the method of the invention, the therapeutically effective 
dose can be estimated initially from cell culture assays. A dose may be formulated in 
animal models to achieve a circulating plasma concentration range that includes the IC50 
(i.e., the concentration of the compound that achieves a half-maximal inhibition of 
symptoms) as determined in cell culture. Such information can be used to more accurately 
determine useful doses in humans. Levels in plasma may be measured, for example, by 
high performance liquid chromatography. 

5.7. Desi2n of Congeners or Analogs 

The compounds which display the desired biological activity can be used as 
lead compounds for the development or design of congeners or analogs having usefiil 
pharmacological activity. For example, once a lead compound is identified, molecular 
modeling techniques can be used to design variants of the compound that can be mote 
effective. Examples of molecular modeling systems are the CHARM and QUANTA 
programs (Polygen Corporation, Waltham, MA). CHARM performs the energy 
minimization and molecular dynamics functions. QUANTA performs the construction, 
graphic modelling and analysis of molecular structure. QUANTA allows interactive 
construction, modification, visualization, and analysis of the behavior of molecules with 
each other. 

A number of articles review computer modeling of drugs interactive with 

specific proteins, such as Rotivinen et al., 1988, Acta Pharmaceutical Fennica 97:159-166; 

Ripka, 1998, New Scientist 54-57; McKinaly & Rossmann, 1989, Annu. Rev. Pharmacol. 

Toxiciol. 29:1 1 1-122; Perry & Davies, OSAR: Quantitative Structure-Activity 

Relationships in Drug Design pp. 189-193 (Alan R. Liss, Inc. 1989); Lewis & Dean, 1989, 

Proc. R. Soc. Lond. 236:125-140 and 141-162; Askew et al., 1989, J. Am. Chem. Soc. 

1 1 1 :1082-1090. Other computer programs that screen and graphically depict chemicals are 

available from companies such as BioDesign, Inc. (Pasadena, California), Allelix. Inc. 

(Mississauga, Ontario, Canada), and Hypercube, Inc. (Cambridge, Ontario). Although these 
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are primarily designed for application to drugs specific to particular proteins, they can be 
adapted to design of drugs specific to any identified region. The analogs and congeners can 
be tested for binding to translational machinery using assays well-known in the art or 
described herein for biologic activity. Alternatively, lead compounds with little or no 
biologic activity, as ascertained in the screen, can also be used to design analogs and 
congeners of the compound that have biologic activity. 

5.8. Use of Identified Compounds to Treat/Pr event a Disease or Disorder 

The present invention provides methods of preventing, treating, managing or 
ameUorating a disorder associated with premature translation termination and/or nonsense- 
mediated mRNA decay, or one or more symptoms thereof, said methods comprising 
administering to a subject in need thereof one or more compounds identified in accordance 
with the methods of the invention or a pharmaceutically acceptable salt thereof. Examples 
of diseases associated with, characterized by or caused by associated with premature 
translation termination and/or nonsense-mediated mRNA decay include, but are not limited 
to, cystic fibrosis, muscular dystrophy, heart disease, lung cancer, breast cancer, colon 
cancer, pancreatic cancer, non-Hodgkin's lymphoma, ovarian cancer, esophageal cancer, 
colorectal carcinomas, neurofibromatosis, retinoblastoma, Wilm's tumor, retinitis 
pigmentosa, collagen disorders, cirrhosis, Tay-Sachs disease, blood disorders, kidney 
stones, ataxia-telangiectasia, lysosomal storage diseases, and tuberous sclerosis. See 
Sections 5.5 and 8 for additional non-limiting examples of diseases and genetic disorders 
which can be prevented, treated, managed or ameliorated by administering one or more of 
the compounds identified in accordance with tiie methods of the invention or a 
pharmaceutically acceptable salt tfiereof Genes that contain one or more nonsense 
mutations that are potentially involved in causing disease are presented in table form 
according to chromosome location in Example 8 infra. 

In a preferred embodiment, it is first determined that the patient is suffering 
from a disease associated with premature translation termination and/or nonsense-mediated 
mRNA decay before administering a compound identified in accordance with the invention 
or a combination therapy described herein. In a preferred embodiment, the DNA of the 
patient can be sequenced or subject to Southern Blot, polymerase chain reaction (PGR), use 
of the Short Tandem Repeat (STR), or polymorphic length restriction fragments (RFLP) 
analysis to determine if a nonsense mutation is present in the DNA of the patient. 
Alternatively, it can be determined if altered levels of the protein with the nonsense 
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mutation are expressed in the patient by western blot or other immunoassays. Such methods 
are well known to one of skill in the art. 

In one embodiment, the invention provides a method of preventing, treating, 
managing or ameliorating a disorder or one or more symptoms thereof, said method 
comprising administering to a subject in need thereof a dose of a prophylactically or 
therapeutically effective amount of one or more compounds identified in accordance with 
the methods of the invention. In another embodiment, a compound identified in accordance 
with the methods of the invention is not administered to prevent, treat, or ameliorate a 
disorder or one or more symptoms thereof, if such compound has been used previously to 
prevent, treat, manage or ameliorate said disorder. In a more specific embodiment of the 
invention, disorders that can be treated with the compounds of the invention, include, but 
are not limited to, disorders that are associated with, characterized by or caused by 
premature translation termination and/or nonsense mediated mRNA decay. 

The invention also provides methods of preventing, treating, managing or 
ameliorating a disorder associated with, characterized by or caused by premature translation 
termination and/or nonsense mediate mRNA decay, or one or more symptoms thereof, said 
methods comprising administering to a subject in need thereof one or more of the 
compounds identified utilizing the screening methods described herein or a 
phanmaceutically acceptable salt thereof, and one or more other therapies (e.g., prophylactic 
or therapeutic agents). Preferably, the other therapies are currently being used, have been 
used or are known to be useful in the prevention, treatment, management or amelioration of 
said disorder or a symptom thereof. Non-limiting examples of such therapies are in Section 
5.8.1 infra. 

The therapies (e.g,, prophylactic or therapeutic agents) or the combination 
therapies of the invention can be administered sequentially or concurrently. In a specific 
embodiment, the combination therapies of the invention comprise a compound identified in 
accordance with the invention and at least one other therapy that has the same mechanism of 
action as said compound. In another specific embodiment, the combination therapies of the 
invention comprise a compound identified in accordance with the methods of the invention 
and at least one other therapy (e,g, prophylactic or therapeutic agent) which has a different 
mechanism of action than said compound. The combination therapies of the present 
invention improve the prophylactic or therapeutic effect of a compound of the invention by 
functioning together with the compound to have an additive or synergistic effect. The 
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combination therapies of the present invention reduce the side effects associated with the 
therapies {e.g., prophylactic or therapeutic agents). 

The prophylactic or therapeutic agents of the combination therapies can be 
administered to a subject in the same pharmaceutical composition. Alternatively, the 
prophylactic or therapeutic agents of the combination therapies can be administered 
concurrently to a subject in separate pharmaceutical compositions. The prophylactic or 
therapeutic agents may be administered to a subject by the same or different routes of 
administration. 

In a specific embodiment, a pharmaceutical composition comprising one or 
more compounds identified in a screening assay described herein is administered to a 
subject, preferably a human, to prevent, treat, manage or ameliorate a disorder associated 
with, characterized by or caused by premature translation termination and/or nonsense 
mediated mRNA decay or one or more symptoms thereof In accordance with the 
invention, the phaimaceutical composition may also comprise one or more other 
prophylactic or therapeutic agents. Preferably, such prophylactic or theapeutic agents are 
currently being used, have been used or are known to be useful in the prevention, treatment, 
management or amelioration of a disorder associated with, characterized by, or caused by 
premature translation termination or nonsense-mediated mRNA decay or one or more 
symptoms thereof. 

A compound identified in accordance with the methods of the invention may 
be used as a first, second, third, fourth or fifth line of therapy for a disorder associated with, 
characterized by or caused by premature translation termination and/or nonsense-mediated 
mRNA decay. The invention provides methods for treating, managing or ameliorating a 
disorder associated with, characterized by or caused by premature translation termination 
and/or nonsense-mediated mRNA decay or one or more symptoms thereof in a subject 
refi-actory to conventional therapies for such disorder, said methods comprising 
administering to said subject a dose of a prophylactically or therapeutically effective 
amount of a compound identified in accordance with the methods of the invention. In 
particular, a disorder may be determined to be refiractory to a therapy when at least some 
significant portion of the disorder is not resolved in response to the therapy. Such a 
determination can be made either in vivo or in vitro by any method known in the art for 
assaying the effectiveness of a therapy on a subject, using the art-accepted meanings of 
"refractor/' in such a context. In a specific embodiment, a disorder is refractory where the 
number of symptoms of the disorder has not been significantly reduced, or has increased. 
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The invention provides methods for treating, managing or ameliorating one 
or more symptoms of a disorder associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay in a subject refractory to 
existing single agent therapies for such disorder, said methods comprising administering to 
said subject a dose of a prophylactically or therapeutically effective amount of a compound 
identified in accordance with the methods of the invention and a dose of a prophylactically 
or therapeutically effective amount of one or more other therapies (e,g,, prophylactic or 
therapeutic agents). The invention also provides methods for treating or managing a 
disorder associated with, characterized by or caused by premature translation termination 
and/or nonsense-mediated mRNA decay by administering a compound identified in 
accordance with the methods of the invention in combination with any other therapy (e.g., 
radiation therapy, chemotherapy or surgery) to patients who have proven refractory to other 
therapies but are no longer on these therapies. The invention also provides methods for the 
treatment or management of a patient having a disorder associated with, characterized by or 
caused by premature translation termination and/or nonsense-mediated mRNA decay and 
said patient is inmiunosuppressed by reason of having previously undergone other therapies. 
Further, the invention provides methods for preventing the recurrence of a disorder 
associated with, characterized by or caused by premature translation termination and/or 
nonsense-mediated mRNA decay such as, e.g., cancer in patients that have been undergone 
therapy and have no disease activity by administering a compound identified in accordance 
with the methods of the invention. 

5.8.1, Other Therapies 

The present invention provides methods of preventing, treating, managing or 

ameliorating a disorder associated with, characterized by or caused by premature translation 

temiination and/or nonsense-mediated mRNA decay, or one or more symptoms thereof, said 

methods comprising administering to a subject in need thereof one or more compounds 

identified in accordance with the methods of the invention or a pharmaceutically acceptable 

salt thereof, and one or more other therapies (e.g., prophylactic or therapeutic agents). Any 

therapy (e.g., chemotherapies, radiation therapies, hormonal therapies, and/or biological 

therapies/immunotherapies) which is known to be useful, or which has been used or is 

currently being used for the prevention, treatment, management or amelioration of disorders 

associated with, characterized by or caused by premature translation termination and/or 

nonsense-mediated mRNA decay or one or more symptoms thereof can be used in 

combination with a compound identified in accordance with the methods of the invention. 
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Examples of therapeutic or prophylactic agents which can be used in combination with a 
compound identified in accordance with the invention include, but are not limited to, 
peptides, polypeptides, fusion proteins, nucleic acid molecules, small molecules, mimetic 
agents, synthetic drugs, inorganic molecules, and organic molecules. 

Proliferative disorders associated with, characterized by or caused by 
premature translation termination and/or nonsense-mediated mRNA decay can be 
prevented, treated, managed or ameliorated by administering to a subject in need thereof 
one or more of the compounds identified in accordance with the methods of the invention, 
and one or more other therapies for prevention, treatment, management or amelioration of 
said disorders or a symptom thereof. Examples of such therapies include, but are not 
limited to, angiogenesis inhibitors, topoisomerase inhibitors, immunomodulatory agents 
(such as chemotherapeutic agents) and radiation therapy. Angiogenesis inhibitors (i.e., anti- 
angiogenic agents) include, but are not limited to, angiostatin (plasminogen fragment); 
antiangiogenic antithrombin 10; angiozyme; ABT-627; Bay 12-9566; Benefin; 
Bevacizumab; BMS-275291; cartilage-derived inhibitor (CDI); CAI; CD59 complement 
fragment; CEP-7055; Col 3; combretastatin A-4; endostatin (collagen XVm fragment); 
fibronectin fragment; Gro-beta; Halofiiginone; Heparinases; Heparin hexasaccharide 
fragment; HMV833; human chorionic gonadotropin (hCG); IM-862; Interferon 
alpha^eta/gamma; Interferon inducible protein (IP-10); Interieukin-12; Kringle 5 
(plasminogen fragment); Marimastat; Metalloproteinase inhibitors (TIMPs); 2- 
methoxyestradiol; MMI 270 (COS 27023A); MoAb IMC-lCll; Neovastat; NM-3; Panzem; 
PI-88; Placental ribonuclease inhibitor; plasminogen activator inhibitor; platelet factor-4 
(PF4); Prinomastat; Prolactin 16kD fragment; Proliferin-related protein (PRP); PTK 
787/ZK 222594; retinoids; solimastat; squalamine; SS 3304; SU 5416; SU6668; SUl 1248; 
tetrahydiDCortisol-S; tetrathiomolybdate; thalidomide; thrombospondin-l (TSP-I); TNP- 
470; transforming growth factor-beta; vasculostatin; vasostatin (calreticulin fragment); 
ZD6126; ZD 6474; famesyl transferase inhibitors (FTI); and bisphosphonates. In a specific 
embodiment, anti-angiogenic agents do not include antibodies or fragments thereof that 
immunospecifically bind to integrin ocv^3- 

Specific examples of propylactic or therapeutic agents which can be used in 
accordance with the methods of the invention to prevent, treat, manage or ameliorate a 
proliferative disorder associated with, characterized by or caused by premature translation 
termination and/or nonsense-mediated mRNA decay, or a symptom thereof include, but not 
limited to: acivicin; aclarubicin; acodazole hydrochloride; acronine; adozelesin; aldesleukin; 
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altretamine; ambomycin; ametantrone acetate; aminoglutethimide; amsacrine; anastrozole; 
anthramycin; asparaginase; asperlin; azacitidine; azetepa; azotomycin; batimastat; 
benzodepa; bicalutamide; bisantrene hydrochloride; bisnafide dimesylate; bizelesin; 
bleomycin sulfate; brequinar sodium; bropirimine; busulfan; cactinomycin; calusterone; 
caracemide; carbetimer; carboplatin; carmustine; canibicin hydrochloride; carzelesin; 
cedefingol; chlorambucil; cirolemycin; cisplatin; cladribine; crisnatol mesylate; 
cyclophosphamide; cytarabine; dacarbazine; dactinomycin; daunorubicin hydrochloride; 
decitabine; dexormaplatin; dezaguanine; dezaguanine mesylate; diaziquone; docetaxel; 
doxorubicin; doxorubicin hydrochloride; droloxifene; droloxifene citrate; dromostanolone 
propionate; duazomycin; edatrexate; eflomithine hydrochloride; elsamitrucin; enloplatin; 
enpromate; epipropidine; epirubicin hydrochloride; erbulozole; esorubicin hydrochloride; 
estramustine; estramustine phosphate sodium; etanidazole; etoposide; etoposide phosphate; 
etoprine; fadrozole hydrochloride; fazarabine; fenretinide; floxuridine; fludarabine 
phosphate; fluorouracil; flurocitabine; fosquidone; fostriecin sodium; gemcitabine; 
gemcitabine hydrochloride; hydroxyurea; idarubicin hydrochloride; ifosfamide; ilmofosine; 
interleukin II (including recombinant interleukin H, or rIL2), interferon alpha-2a; interferon 
alpha-2b; interferon alpha-nl ; interferon alpha-n3; interferon beta-I a; interferon gamma-I 
b; iproplatin; irinotecan hydrochloride; lanreotide acetate; letrozole; leuprolide acetate; 
liarozole hydrochloride; lometrexol sodium; lomustine; losoxantrone hydrochloride; 
masoprocol; maytansine; mechlorethamine hydrochloride; megestrol acetate; melengestrol 
acetate; melphalan; menogaril; mercaptopurine; methotrexate; methotrexate sodium; 
metoprine; meturedepa; mitindomide; mitocarcin; mitocromin; mitogillin; mitomalcin; 
mitomycin; mitosper; mitotane; mitoxantrone hydrochloride; mycophenolic acid; 
nocodazole; nogalamycin; ormaplatin; oxisuran; paclitaxel; pegaspargase; peliomycin; 
pentamustine; peplomycin sulfate; perfosfamide; pipobroman; piposulfan; piroxantrone 
hydrochloride; plicamycin; plomestane; porfimer sodium; porfiromycin; prednimustine; 
procarbazine hydrochloride; puromycin; puromycin hydrochloride; pyrazofurin; riboprine; 
rogletimide; safingol; safingol hydrochloride; semustine; simtrazene; sparfosate sodium; 
sparsomycin; spirogermanium hydrochloride; spiromustine; spiroplatin; streptonigrin; 
streptozocin; sulofenur; talisomycin; tecogalan sodium; tegafiir; teloxantrone hydrochloride; 
temoporfin; teniposide; teroxirone; testolactone; thiamiprine; thioguanine; thiotepa; 
tiazofiirin; tirapazamine; toremifene citrate; trestolone acetate; triciribine phosphate; 
trimetrexate; trimetrexate glucuronate; triptorelin; tubulozole hydrochloride; uracil mustard; 
uredepa; vapreotide; verteporfin; vinblastine sulfate; vincristine sulfate; vindesine; 
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vindesine sulfate; yinepidine sulfate; vinglycinate sulfate; vinleurosine sulfate; vinorelbine 
tartrate; vinrosidine sulfate; vinzolidine sulfate; vorozole; zeniplatin; zinostatin; zorubicin 
hydrochloride. Other anti-cancer drugs include, but are not limited to: 20-epi-l,25 
dihydroxyvitamin D3; 5-ethynyluracil; abiraterone; aclarubicin; acylfulvene; adecypenol; 
adozelesin; aldesleukin; ALL-TK antagonists; altretamine; ambamustine; amidox; 
amifostine; aminolevulinic acid; amrubicin; amsacrine; anagrelide; anastrozole; 
andrographolide; angiogenesis inhibitors; antagonist D; antagonist G; antarelix; 
anti-dorsalizing morphogenetic protein-1; antiandrogen, prostatic carcinoma; antiestrogen; 
antineoplaston; antisense oligonucleotides; aphidicolin glycinate; apoptosis gene 
modulators; apoptosis regulators; apurinic acid; ara-CDP-DL-PTB A; arginine deaminase; 
asulacrine; atamestane; atrimustine; axinastatin 1; axinastatin 2; axinastatin 3; azasetron; 
azatoxin; azatyrosine; baccatin m derivatives; balanol; batimastat; BCR/ABL antagonists; 
benzochlorins; benzoylstaurosporine; beta lactam derivatives; beta-alethine; betaclamycin 
B; betulinic acid; bFGF inhibitor, bicalutamide; bisantrene; bisaziridinylspermine; 
bisnafide; bistratene A; bizelesin; breflate; bropirimine; budotitane; buthionine sulfoximine; 
calcipotriol; calphostin C; camptothecin derivatives; canarypox IL-2; capecitabine; 
carboxamide-amino-triazole; carboxyamidotriazole; CaRest M3; CARN 700; cartilage 
derived inhibitor; carzelesin; casein kinase inhibitors (ICOS); castanospermine; cecropin B; 
cetroreUx; chlorlns; chloroquinoxaline sulfonamide; cicaprost; cis-porphyrin; cladribine; 

clomifene analogues; clotrimazole; coUismycin A; coUismycin B; combretastatin A4; 

combretastatin analogue; conagenin; crambescidin 816; crisnatol; cryptophycin 8; 

cryptophycin A derivatives; curacin A; cyclopentanthraquinones; cycloplatam; cypemycin; 

cytarabine ocfosfate; cytolytic factor; cytostatin; dacliximab; decitabine; dehydrodidemnin 

B; deslorelin; dexamethasone; dexifosfamide; dexrazoxane; dexverapamil; diaziquone; 

didemnin B; didox; diethylnorspermine; dihydro-5-azacytidine; dihydrotaxol, 9-; 

dioxamycin; diphenyl spiromustine; docetaxel; docosanol; dolasetron; doxifluridine; 

droloxifene; dronabinol; duocarmycin SA; ebselen; ecomustine; edelfosine; edrecolomab; 

eflomithine; elemene; emitefur; epirubicin; epristeride; estramustine analogue; estrogen 

agonists; estrogen antagonists; etanidazole; etoposide phosphate; exemestane; fadrozole; 

fazarabine; fenretinide; filgrastim; finasteride; flavopiridol; flezelastine; fluasterone; 

fiudarabine; fluorodaunorunicin hydrochloride; forfenimex; formestane; fostriecin; 

fotemustine; gadolinium texaphyrin; gallium nitrate; galocitabine; ganirelix; gelatinase 

inhibitors; gemcitabine; glutathione inhibitors; hepsulfam; heregulin; hexamethylene 

bisacetamide; hypericin; ibandronic acid; idarubicin; idoxifene; idramantone; ilmofosine; 
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ilomastat; imidazoacridones; imiquimod; immunostimulant peptides; insulin-like growth 
factor-] receptor inhibitor; interferon agonists; interferons; interleukins; iobenguane; 
iododoxorubicin; ipomeanol, 4-; iroplact; irsogladine; isobengazole; isohomohalicondrin B; 
itasetron; jasplakinolide; kahalalide F; lamellarin-N triacetate; lanreotide; leinamycin; 
lenograstim; lentinan sulfate; leptolstatin; letrozole; leukemia inhibiting factor; leukocyte 
alpha interferon; leuprolide+estrogen+progesterone; leuprorelin; levamisole; liarozole; 
linear polyamine analogue; lipophilic disaccharide peptide; lipophilic platinum compounds; 
lissoclinamide 7; lobaplatin; lombricine; lometrexol; lonidamine; losoxantrone; lovastatin; 
loxoribine; lurtotecan; lutetium texaphyrin; lysofylline; lytic peptides; maitansine; 
mannostatin A; marimastat; masoprocol; maspin; matrilysin inhibitors; matrix 
metalloproteinase inhibitors; menogaril; merbarone; meterelin; methioninase; 
metoclopramide; MIF inhibitor; mifepristone; miltefosine; mirimostim; mismatched double 
stranded RNA; mitoguazone; mitolactol; mitomycin analogues; mitonafide; mitotoxin 
fibroblast growth factor-saporin; mitoxantrone; mofarotene; molgramostim; monoclonal 
antibody, human chorionic gonadotrophin; monophosphoryl lipid A+myobacterium cell 
wall sk; mopidamol; multiple dmg resistance gene inhibitor; multiple tumor suppressor 
1 -based therapy; mustard anticancer agent; mycaperoxide B; mycobacterial cell wall 
extract; rayriaporone; N-acetyldinaline; N-substituted benzamides; nafarelin; nagrestip; 
naloxone+pentazocine; napavin; naphterpin; nartograstim; nedaplatin; nemorubicin; 
neridronic acid; neutral endopeptidase; nilutamide; nisamycin; nitric oxide modulators; 
nitroxide antioxidant; nitrullyn; 06-benzylguanine; octreotide; okicenone; oligonucleotides; 
onapristone; ondansetron; ondansetron; oracin; oral cytokine inducer; ormaplatin; 
osaterone; oxaliplatin; oxaunomycin; paciitaxel; paclitaxel analogues; paclitaxel derivatives; 
palauamine; pahnitoylrhizoxin; pamidronic acid; panaxytriol; panomifene; parabactin; 
pazelliptine; pegaspargase; peldesine; pentosan polysulfate sodium; pentostatin; pentrozole; 
perflubron; perfosfamide; perillyl alcohol; phenazinomycin; phenylacetate; phosphatase 
inhibitors; picibanil; pilocarpine hydrochloride; pirarubicin; piritrexim; placetin A; placetin 
B; plasminogen activator inhibitor; platinum complex; platinum compounds; 
platinum-triamine complex; porfimer sodium; porfiromycin; prednisone; propyl 
bis-acridone; prostaglandin J2; proteasome inhibitors; protein A-based immune modulator; 
protein kinase C inhibitor; protein kinase C inhibitors, microalgal; protein tyrosine 
phosphatase inhibitors; purine nucleoside phosphorylase inhibitors; purpurins; 
pyrazoloacridine; pyridoxylated hemoglobin polyoxyethylene conjugate; raf antagonists; 
raltitrexed; ramosetron; ras famesyl protein transferase inhibitors; ras inhibitors; ras-GAP 
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inhibitor; retelliptine demethylated; rhenium Re 186 etidronate; rhizoxin; ribozymes; RII 
retinamide; rogletimide; rohitukine; romurtide; roquinimex; rubiginone Bl; ruboxyl; 
safingol; saintopin; SarCNU; sarcophytol A; sargramostim; Sdi 1 mimetics; semustine; 
senescence derived inhibitor 1; sense oligonucleotides; signal transduction inhibitors; signal 
transduction modulators; single chain antigen binding protein; sizofiran; sobuzoxane; 
sodium borocaptate; sodium phenylacetate; solverol; somatomedin binding protein; 
sonermin; sparfosic acid; spicamycin D; spiromustine; splenopentin; spongistatin 1; 
squalamine; stem cell inhibitor; stem-cell division inhibitors; stipiamide; stromelysin 
inhibitors; sulfmosine; superactive vasoactive intestinal peptide antagonist; suradista; 
suramin; swainsonine; synthetic glycosaminoglycans; tallimustine; 5-fluorouracil; 
leucovorin; tamoxifen methiodide; tauromustine; tazarotene; tecogalan sodium; tegafur; 
tellurapyrylium; telomerase inhibitors; temoporfin; temozolomide; teniposide; 
tetrachlorodecaoxide; tetrazomine; thaliblastine; thiocoraline; thrombopoietin; 
thrombopoietin mimetic; thymalfasin; thymopoietin receptor agonist; thymotrinan; thyroid 
stimulating hormone; tin ethyl etiopuipurin; tirapazamine; titanocene bichloride; topsentin; 
toremifene; totipotent stem cell factor; translation inhibitors; tretinoin; triacetyluridine; 
triciribine; trimetrexate; triptorelin; tropisetron; turosteride; tyrosine kinase inhibitors; 
tyrphostins; UBC inhibitors; ubenimex; urogenital sinus-derived growth inhibitory factor; 
urokinase receptor antagonists; vapreotide; variolin B; vector system, erythrocyte gene 
therapy; thalidomide; velaresol; veramine; verdins; verteporfin; vinorelbine; vinxaltine; 
vorozole; zanoterone; zeniplatin; zilascorb; and zinostatin stimalamer. 

Specific examples of propylactic or therapeutic agents which can be used in 
accordance with the methods of the invention to prevent, treat, manage and/or ameliorate a 
central nervous system disorders associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay, or a symptom thereof 
include, but are not limited to: Levodopa, L-DOPA, cocaine, Of-methyl-tyrosine, reserpine, 
tetrabenazine, benzotropine, pargyline, fenodolpam mesylate, cabergoline, pramipexole 
dihydrochloride, ropinorole, amantadine hydrochloride, selegiline hydrochloride, carbidopa, 
pergolide mesylate, Sinemet CR, or Symmetrel. 

Specific examples of propylactic or therapeutic agents which can be used in 
accordance with the methods of the invention to prevent, treat, manage and/or ameliorate a 
metabolic disorders associated with, characterized by or caused by premature translation 
termination and/or nonsense-mediated mRNA decay, or a symptom thereof include, but are 
not limited to: a monoamine oxidase inhibitor (MAO), for example, but not limited to, 
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iproniazid, clorgyline, phenelzine and isocarboxazid; an acetylcholinesterase inhibitor, for 
example, but not limited to, physostigmine saliclate, physostigmine sulfate, physostigmine 
bromide, meostigmine bromide, neostigmine methylsulfate, ambenonim chloride, 
edrophonium chloride, tacrine, pralidoxime chloride, obidoxime chloride, trimedoxime 
bromide, diacetyl monoxim, endrophonium, pyridostigmine, and demecarium; an anti- 
inflammatory agent, including, but not limited to, naproxen sodium, diclofenac sodium, 
diclofenac potassium, celecoxib, sulindac, oxaprozin, diflunisal, etodolac, meloxicam, 
ibuprofen, ketoprofen, nabumetone, refecoxib, methotrexate, leflunomide, sulfasalazine, 
gold salts, RHo-D Immune Globulin, mycophenylate mofetil, cyclosporine, azathioprine, 
tacrolimus, basiliximab, daclizimiab, salicylic acid, acetylsalicylic acid, methyl salicylate, 
diflunisal, salsalate, olsalazine, sulfasalazine, acetaminophen, indomethacin, sulindac, 
mefenamic acid, meclofenamate sodium, tolmetin, ketorolac, dichlofenac, flurbinprofen, 
oxaprozin, piroxicam, meloxicam, ampiroxicam, droxicam, pivoxicam, tenoxicam, 
phenylbutazone, oxyphenbutazone, antipyrine, aminopyrine, apazone, zileuton, 
aurothioglucose, gold sodium thiomalate, auranofin, methotrexate, colchicine, allopurinol, 
probenecid, sulfinpyrazone and benzbromarone or betamethasone and other glucocorticoids; 
an antiemetic agent, for example, but not limited to, metoclopromide, domperidone, 
prochlorperazine, promethazine, chlorpromazine, trimethobenzamide, ondansetron, 
granisetron, hydroxyzine, acetylleucine monoethanolamine, alizapride, azasetron, 
benzquinamide, bietanautine, bromopride, buclizine, clebopride, cyclizine, dimenhydrinate, 
diphenidol, dolasetron, meclizine, methallatal, metopimazine, nabilone, oxypemdyl, 
pipamazine, scopolamine, sulpiride, tetrahydrocannabinol, thiethylperazine, 
thioproperazine, tropisetron, and mixtures thereof. 

5.9. Compounds and Methods of Administering Compounds 

Biologically active compounds identified using the methods of the invention 
or a pharmaceutical ly acceptable salt thereof can be administered to a patient, preferably a 
mammal, more preferably a human, suffering from a disorder associated with, characterized 
by or caused by premature translation termination and/or nonsense mediated mRNA decay. 
In a specific embodiment, a compound or a pharmaceutically acceptable salt thereof is 
administered to a patient, preferably a mammal, more preferably a human, as a preventative 
measure against a disorder associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay. 

When administered to a patient, the compound or a pharmaceutically 

acceptable salt thereof is preferably administered as component of a composition that 
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optionally comprises a pharmaceutically acceptable vehicle. The composition can be 
administered orally, or by any other convenient route, for example, by infusion or bolus 
injection, by absorption through epithelial or mucocutaneous linings {e.g., oral mucosa, 
rectal, and intestinal mucosa, etc.) and may be administered together with another 
biologically active agent. Administration can be systemic or local. Various delivery 
systems are known, e.g., encapsulation in liposomes, microparticles, microcapsules, 
capsules, etc., and can be used to administer the compound and pharmaceutically acceptable 
salts thereof. 

Methods of administration include but are not limited to intradermal, 
intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, oral, 
sublingual, intranasal, intracerebral, intravaginal, transdermal, rectally, by inhalation, or 
topically, particularly to the ears, nose, eyes, or skin. The mode of administration is left to 
the discretion of the practitioner. In most instances, administration will result in the release 
of the compound or a pharmaceutically acceptable salt thereof into the bloodstream. 

In specific embodiments, it may be desirable to administer the compound or 
a pharmaceutically acceptable salt thereof locally. This may be achieved, for example, and 
not by way of limitation, by local infusion during surgery, topical application, e.g., in 
conjunction with a wound dressing after surgery, by injection, by means of a catheter, by 
means of a suppository, or by means of an implant, said implant being of a porous, non- 
porous, or gelatinous material, including membranes, such as sialastic membranes, or fibers. 

In certain embodiments, it may be desirable to introduce the compound or a 
pharmaceutically acceptable salt thereof into the central nervous system by any suitable 
route, including intraventricular, intrathecal and epidural injection. Intraventricular 
injection may be facilitated by an intraventricular catheter, for example, attached to a 
reservoir, such as an Ommaya reservoir. 

Puhnonary administration can also be employed, e.^., by use of an inhaler or 
nebulizer, and formulation with an aerosolizing agent, or via perfusion in a fluorocarbon or 
synthetic pulmonary surfactant. In certain embodiments, the compound and 
pharmaceutically acceptable salts thereof can be formulated as a suppository, with 
traditional binders and vehicles such as triglycerides. 

In another embodiment, the compound and pharmaceutically acceptable salts 
thereof can be delivered in a vesicle, in particular a liposome (see Langer, 1990, Science 
249:1527-1533; Treat et al., in Liposomes in the Therapy of Infectious Disease and Cancer, 
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Lopez-Berestein and Fidler (eds.), Liss, New York, pp. 353-365 (1989); Lopez-Berestein, 
ibid.^ pp. 31 7-327; see generally ibid.). 

In yet another embodiment, the compound and pharmaceutically acceptable 
salts thereof can be delivered in a controlled release system (see, e.g., Goodson, in Medical 
Applications of Controlled Release, supra, vol. 2, pp. 1 15-138 (1984)). Other controUed- 
release systems discussed in the review by Langer, 1990, Science 249:1527-1533 may be 
used. In one embodiment, a pump may be used (see Langer, supra\ Sefton, 1987, CRC Crit. 
Ref. Biomed. Eng. 14:201; Buchwald et al., 1980, Surgery 88:507; Saudek et al., 1989, N. 
EngL J. Med. 321 :574). In another embodiment, polymeric materials can be used (see 
Medical Applications of Controlled Release, Langer and Wise (eds.), CRC Pres., Boca 
Raton, Florida (1974); Controlled Drug Bioavailability, Drug Product Design and 
Performance, Smolen and Ball (eds.), Wiley, New York (1984); Ranger and Peppas, 1983, 
J. Macromol. Sci. Rev. Macromol. Chem. 23:61; see also Levy et al., 1985, Science 
228:190; During et al., 1989, Ann. Neurol. 25:351; Howard et al., 1989, J. Neurosurg. 
71 : 105). In yet another embodiment, a controlled-release system can be placed in proximity 
of a target RNA of the compound or a pharmaceutically acceptable salt thereof, thus 
requiring only a fraction of the systemic dose. 

Compositions comprising the compound or a pharmaceutically acceptable 
salt thereof ("compound compositions") can additionally comprise a suitable amount of a 
pharmaceutically acceptable vehicle so as to provide the form for proper administration to 
the patient. 

In a specific embodiment, the temi "phamiaceutically acceptable" means 
approved by a regulatory agency of the Federal or a state government or listed in the U.S. 
Pharmacopeia or other generally recognized pharmacopeia for use in animals, mammals, 
and more particularly in humans. The term "vehicle" refers to a diluent, adjuvant, 
excipient, or earner with which a compound of the invention is administered. Such 
pharmaceutical vehicles can be liquids, such as water and oils, including those of petroleum, 
animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil 
and the like. The pharmaceutical vehicles can be saline, gum acacia, gelatin, starch paste, 
talc, keratin, colloidal silica, urea, and the like. In addition, auxiliary, stabilizing, 
thickening, lubricating and coloring agents may be used. When administered to a patient, 
the pharaiaceutically acceptable vehicles are preferably sterile. Water is a preferred vehicle 
when the compound of the invention is administered intravenously. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as liquid vehicles, 
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particularly for injectable solutions. Suitable pharmaceutical vehicles also include 
excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica 
gel, sodium stearate, glycerol monostcarate, talc, sodium chloride, dried skim milk, 
glycerol, propylene, glycol, water, ethanol and the like. Compound compositions, if 
desired, can also contain minor amounts of wetting or emulsifying agents, or pH buffering 
agents. 

Compound compositions can take the form of solutions, suspensions, 
emulsion, tablets, pills, pellets, capsules, capsules containing liquids, powders, sustained- 
release formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other 
form suitable for use. In one embodiment, the pharmaceutically acceptable vehicle is a 
capsule (see e.g., U.S. Patent No. 5.698,155). Other examples of suitable pharmaceutical 
vehicles are described in Remington's Pharmaceutical Sciences. Alfonso R. Gennaro, ed.. 
Mack Publishing Co. Easton,PA. 19th ed., 1995, pp. 1447 to 1676, incorporated herein by 
reference. 

In a preferred embodiment, the compound or a pharaiaceutically acceptable 
salt thereof is formulated in accordance with routine procedures as a pharmaceutical 
composition adapted for oral administration to human beings. Compositions for oral 
delivery may be in the form of tablets, lozenges, aqueous or oily suspensions, granules, 
powders, emulsions, capsules, syrups, or elixirs, for example. Orally administered 
compositions may contain one or more agents, for example, sweetening agents such as 
fructose, aspartame or saccharin; flavoring agents such as peppermint, oil of wintergreen, or 
cherry; coloring agents; and preserving agents, to provide a pharmaceutically palatable 
preparation. Moreover, where in tablet or pill form, the compositions can be coated to delay 
disintegration and absorption in the gastrointestinal tract thereby providing a sustained 
action over an extended period of time. Selectively permeable membranes surrounding an 
osmotically active driving compound are also suitable for orally administered compositions. 
In these later platforms, fluid from the environment surrounding the capsule is imbibed by 
the driving compound, which swells to displace the agent or agent composition through an 
aperture. These delivery platforms can provide an essentially zero order delivery profile as 
opposed to the spiked profiles of immediate release formulations. A time delay material 
such as glycerol monostcarate or glycerol stearate may also be used. Oral compositions can 
include standard vehicles such as mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, and the like. Such vehicles are preferably of 
pharmaceutical grade. Typically, compositions for intravenous administration comprise 
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Sterile isotonic aqueous buffer. Where necessary, the compositions may also include a 
solubilizing agent. 

In another embodiment, the compound or a pharmaceutically acceptable salt 
thereof can be fomiulated for intravenous administration. Compositions for intravenous 
administration may optionally include a local anesthetic such as Ugnocaine to lessen pain at 
the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water-free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the compound or a pharmaceutically acceptable salt thereof 
is to be administered by infusion, it can be dispensed, for example, with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the compound or a 
pharmaceutically acceptable salt thereof is administered by injection, an ampoule of sterile 
water for injection or saline can be provided so that the ingredients may be mixed prior to 
administration. 

The amount of a compound or a pharmaceutically acceptable salt thereof that 
will be effective in the treatment of a particular disease will depend on the nature of the 
disease, and can be determined by standard clinical techniques. In addition, in vitro or in 
vivo assays may optionally be employed to help identify optimal dosage ranges. The 
precise dose to be employed will also depend on the route of administration, and the 
seriousness of the disease, and should be decided according to the judgment of the 
practitioner and each patient's circumstances. However, suitable dosage ranges for oral 
administration are generally about 0.001 milligram to about 500 milligrams of a compound 
or a pharmaceutically acceptable salt thereof per kilogram body weight per day. In specific 
preferred embodiments of the invention, the oral dose is about 0.01 milligram to about 100 
milligrams per kilogram body weight per day, more preferably about 0.1 milligram to about 
75 milligrams per kilogram body weight per day, more preferably about 0.5 milligram to 5 
milligrams per kilogram body weight per day. The dosage amounts described herein refer 
to total amounts administered; that is, if more than one compound is administered, or if a 
compound is administered with a therapeutic agent, then the preferred dosages correspond 
to the total amount administered. Oral compositions preferably contain about 10% to about 
95% active ingredient by weight. 

Suitable dosage ranges for intravenous (i.v.) administration are about 0.01 
milligram to about 100 milligrams per kilogram body weight per day, about 0.1 milligram 
to about 35 milligrams per kilogram body weight per day, and about 1 milligram to about 
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10 milligrams per kilogram body weight per day. Suitable dosage ranges for intranasal 
administration are generally about O.Ol pg/kg body weight per day to about 1 mg/Tcg body 
weight per day. Suppositories generally contain about 0.01 milligram to about 50 
milligrams of a compound of the invention per kilogram body weight per day and comprise 
active ingredient in the range of about 0.5% to about 10% by weight. 

Recommended dosages for intradermal, intramuscular, intraperitoneal, 
subcutaneous, epidural, sublingual, intracerebral, intravaginal, transdermal administration or 
administration by inhalation are in the range of about 0.001 milligram to about 200 
milligrams per kilogram of body weight per day. Suitable doses for topical administration 
are in the range of about 0.001 milligram to about 1 milligram, depending on the area of 
administration. Effective doses may be extrapolated from dose-response curves derived 
from in vitro or animal model test systems. Such animal models and systems are well 
known in the art. 

The compound and pharmaceutically acceptable salts fliereof are preferably 
assayed in vitro and in vivo, for the desired therapeutic or prophylactic activity, prior to use 
in humans. For example, in vitro assays can be used to determine whether it is preferable to 
administer the compound, a pharmaceutically acceptable salt thereof, and/or another 
therapeutic agent. Animal model systems can be used to demonstrate safety and efficacy. 

6 EXAMPLE: PREPARATION OF EXTRACTS FROM HELACELLS FOR 

TNVITRO TRANSLATION REACTIONS 

This Example describes a method of preparing a cell extract to perform in 
vitro translation reactions to monitor nonsense suppression or to produce proteins in vitro. 
This method is different from other methods used to prepare translation extracts for several 
reasons. First, the centrifugation step is performed at low speed (12000Xg) compared to 
most other protocols that use a 100,000Xg spin; and second, the cells are incubated on ice 
for several hours to weeks, which increases the activity of the extract significantly. 

6.1. Preparation of Translation Extract from Hela Cells 

HeLa S3 cells were grown to a density of 10^ cells/ml in DMEM; 
5%C02, 10% FBS, IX P/S in a spinner flask. Cells were harvested by spinning at lOOOXg. 
Cells were washed twice with phosphate buffered saline. The cell pellet sat on ice for 12 to 
24 hours before proceeding. By letting the cells sit on ice, the activity of the extract is 
increased two-fold. The length of time on ice can range from 0 hours to 1 week. The cells 
were resuspended in 1.5 volumes (packed cell volume) of hypotonic buffer (10 mM HEPES 
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(KOH) pH 7.4; 15 miM KCl; 1.5 mM Mg(OAc)2; 0.5 mM Pefabloc (Roche); 2 mM DTT). 
Cells were allowed to swell for 5 minutes on ice and dounce homogenized with 10 to 100 
strokes using a tight-fitting pestle. The cells were spun for 10 minutes at 12000Xg at 4X in 
a Sorvall SS-34 rotor. The supernatant was carefully collected with a Pasteur pipet without 
disturbing the lipid layer and transferred into Eppendorf tubes (50-200 mL aliquots) and 
immediately frozen in liquid nitrogen. Figure 1 shows the amount of wild-type luciferase 
produced in an in vitro translation reaction when the amount of wild-type luciferase RNA 
and the amount of HeLa cell extract are varied. Figure 2 shows the amount of wild-type 
luciferase produced in an in vitro translation reaction when the amount of luciferase RNA 
containing the nonsense mutation RGA and the amount of the cell are varied. 

6.2 Incubating Ceils on Ice Improves Translation Activity of Extract 

As shown in Figure 3, incubating cells on ice prior to preparation of the 
translation extract improves the translation activity up to 20 fold. Further, in the presence of 
the aminoglycoside gentamicin, nonsense suppression activity (as measured by the amount 
of luciferase activity produced from a luciferase RNA containing a UGA premature 
termination codon) is increased 2 to 3 fold above untreated extracts (see Figure 4). These 
results demonstrate that extracts prepared by this method actively translate wild type RNA 
as well as mediate nonsense suppression. 

?• EXAMPLE: IDENTIFICATION AND CHARACTERIZATION OF 

COMPOUND THAT PROMOTE NONSENSE SUPPRESSION 
AND/OR MODULATE TRANSLATION TERMINATION 

7.1. Development of Assays for High Throughput Screens 

Two assays were developed for use in high throughput screens to identify 
small molecules that promote nonsense suppression. Each assay utilized luciferase because 
it is a functional reporter gene assay (light is only produced if the protein is functional) and 
it is extremely sensitive (Light intensity is proportional to luciferase concentration in the nM 
range). The first assay was a cell-based luciferase reporter assay and the second was a 
biochemical assay consisting of rabbit reticulocyte lysate and a nonsense-containing 
luciferase reporter mRNA. In the cell-based assay, a luciferase reporter construct 
containing a UGA premature termination codon was stably transfected in 293T Human 
Embryonic Kidney cells, hi the biochemical assay, mRNA containing a UGA premature 
termination codon was used as a reporter in an in vitro translation reaction using rabbit 
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reticulocyte lysate supplemented with tRNA, hemin, creatine kinase, amino acids, KOAc, 
Mg(OAc)2, and creatine phosphate. Translation of the mRNA was initiated within a virus 
derived leader sequence, which significantly reduced the cost of the assay because capped 
RNA was not required. Synthetic mRNA was prepared in vitro using the T7 promoter and 
the MegaScript in vitro transcription kit (Ambion). In both of the biochemical and 
cell-based assays, addition of gentamicin, a small molecule known to allow readthrough of 
premature termination codons, resulted in increased luciferase activity and was, therefore, 
used as an internal standard. 

7,2. Screening of a Chemical Library Using the Nons ense Suppression Assays 

The assays described above in Section 7.1 were used in two high throughput 
screens. Approximately eight hundred thousand compounds were screened in the cell-based 
and biochemical assays. From these initial screens two hundred hits were retested with both 
luciferase assays and seven compounds were subsequently selected for further investigation. 
These compounds fall into four classes of scaffolds. One class of compound is a nucleoside 
analog; the second class is a quinazoline compound; the third class is an oxadiazole 
compound similar to diarylfuran antibiotics; and the final class is a unique scaffold 
harboring one or more phenyl, amide, or similar fiinctional groups. Interestingly, none of 
the compounds are similar in structure to gentamicin. Compound A (molecular formula 
C19H21NO4), a member of the fourth class and Compound B (molecular formula 
C19H18N2O4), a compound synthesized independently of the screen, because of its potential 
RNA binding properties were the focus of subsequent attention. 

7.3. Compound A and Compound B Increase 
In Vitro Nonsense Suppression at UGA Codons 

Based on the results of the high throughput screen. Compound A was 
characterized further with the in vitro luciferase nonsense suppression assay. To ensure that 
the observed nonsense suppression activity of the selected compounds was not limited to the 
rabbit reticulocyte assay system, HeLa cell extract was prepared and optimized (Lie & 
Macdonald, 1999, Development 1 26(22) :4989-4996 and Lie & Macdonald, 2000, Biochem. 
Biophys. Res. Commun. 270(2):473-481). Figure 5 shows that Compound A and 
Compound B exhibit greater nonsense suppression activity of the UGA codon than 
gentamicin in the HeLa cell translation extracts. 

7.4. Characterization of Compounds That Increase 
Nonsense Suppression and Product Functio n Protein 
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Compound A and Compound B increase the level of nonsense suppression in 
the biochemical assay three to four fold over untreated extracts. To determine whether 
these compounds also function in v/vo, a stable cell line harboring the UGA 
nonsense-containing luciferase gene was treated with each compound. Cells were grown in 
5 standard medium supplemented with 1% penicillin-streptomycin (P/S) and 10% fetal bovine 
serum (FBS) to 70% confluency and split 1:1 the day before treatment. On the following 
day, cells were trypsinized and 40,000 cells were added to each well of a 96-well tissue 
culture dish. Serial dilutions of each compound were prepared to generate a six-point dose 
response curve spanning 2 logs (30/iM to 0.3 /xM). The final concentration of the DMSO 
10 solvent remained constant at 1% in each well. Cells treated with 1% DMSO served as the 
background standard, and cells treated with gentamicin served as a positive control. As 
shown in Figure 6, these two compounds are more potent and efficacious than gentamicin at 
these concentrations. 

Cells were transiently transfected with plasmids harboring the UGA, UAA or 
15 UAG nonsense alleles of luciferase in each codon context (UGAA, UGAC, UGAG, UGAU, 
UAGA, UAGC, UAGG, UAGU, UAAA, UAAC, UAAG, and UAAU) then the cells were 
treated overnight with Compound A, and gentamicin. The following day, the level of 
suppression was determined by measuring the amount of luminescence produced. The fold 
suppression above control cells treated with solvent was calculated and is numerically 
20 reported. The results are presented in Table 2 and Figure 6B. 



Table 2 


Context 


Gentamicin 


Compound A 




Smg/ml 


lOuM 


UAAA 


0.71 


0.17 


UAAC 


2.32 


0.67 


UAAG 


0.01 


0.02 


UAAU 


0.92 


0.33 


UAGA 


1.31 


0.64 


UAGC 


2.16 


3.05 


UAGG 


0.64 


0.51 


UAGU 


0.54 


0.31 


UGAA 


0.76 


0.4 


UGAC 


1.91 


2.96 


UGAG 


0.45 


0.23 


UGAU 


6.74 


1.67 



7.5. Compound A Alters the Accessibility of the Chemical 
Modifying Agents to Specific Nucleotides in the 28S rRNA 
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Previous studies have demonstrated that gentamicin and other members of 
the aminoglycoside family that decrease the fidelity of translation bind to the A site of the 
16S rRNA. By chemical footprinting, UV cross-linking and NMR. gentamicin has been 
shown to bind at the A site (comprised of nucleotides 1400-1410 and 1490-1500, £. coli 
numbering) of the rRNA at nucleotides 1406, 1407, 1494, and 1496 (Moazed & Noller, 
1987, Nature 327(612 1):389-394; Woodcock et at., 1991, EMBO J. 10(10):3099-3103; and 
Schroeder et al, 2000). These observations prompted us to determine whether similar 
experiments could provide inforaiation on the mechanism of action of Compound A. To do 
this, ribosomes prepared from HeLa cells were incubated with the small molecules (at a 
concentration of 100 /iM), followed by treatment with chemical modifying agents (dimethyl 
sulfate [DMS] and kethoxal [KE]). Following chemical modification, rRNA was 
phenol-chloroform extracted, ethanol precipitated, analyzed in primer extension reactions 
using end-labeled oligonucleotides hybridizing to different regions of the three rRNAs and 
resolved on 6% polyacrylamide gels. The probes used for primer extension cover the entire 
IBS (7 oligonucleotide primers), 28S (24 oligonucleotide primers), and 5S (one primer) 
rRNAs. Controls in these experiments include DMSO (a control for changes in rRNA 
accessibility induced by DMSO), paromomycin (a marker for IBS rRNA binding), and 
anisomycin (a marker for 28S rRNA binding). 

The results of these foot-printing experiments indicated that Compound A 
alters the accessibility of the chemical modifying agents to specific nucleotides in the 28S 
rRNA. More specifically, the regions protected by Compound A include: (1) a conserved 
region in the vicinity of the peptidyl transferase center (domain V) implicated in peptide 
bond formation (see Figure 7A) and (2) a conserved region in domain II that may interact 
with the peptidyl transferase center based on binding of vemamycinin B to both these areas 
(Vannuffel et al, 1994, Nucleic Acids Res. 22(21):4449-4453; see Figure 7B). 

7.6. Compound A Causes Readthrough of 
Premature Termination Codons in Cell-b ased Disease Models 

To address the effects of the nonsense-suppressing compounds on mRNAs 
altered in specific inherited diseases, a bronchial epithelial cell line harboring a nonsense 
codon at amino acid 1282 (W1282X) was treated with Compound A (20/tM) and CFTR 
fimction was monitored as a cAMP-activated chloride channel using the SPQ assay (Yang 
et al, 1993, Hum Mol Genet. 2(8):1253-1261 and Howard et al., 1996, Nat Med. 
2(4):467-469). These experiments showed that cAMP treatment of these cells resulted in an 
increase in SPQ fluorescence, consistent with stimulation of CFTR-mediated halide efflux 
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(Figure 8). No increase in fluorescence was observed when cells were not treated with 
compound or if the cells were not stimulated with cAMP. These results indicate that the 
full-length CFTR expressed from this nonsense-containing allele following compound 
treatment also functions as a cAMP-stimulated anion channel, thus demonstrating that 
cystic fibrosis cell lines increase chloride channel activity when treated with Compound A. 

7.7. Primary Cells from the mdx Nonsense-containing Mouse 
Express Full -length Dystrophin Protein When Treated with Compound A 

The mutation in the mdx mouse that premature termination of the 427 kDa 
dystrophin polypeptide has been shown to be a C to T transition at position 3185 in exon 23 
(Sicinski et ai, 1989, Science. 244(491 2): 1578-1 580). Mouse primary skeletal muscle 
cultures derived from 1-day old mdx mice were prepared as described previously 
(Barton-Davis et aL, 1999, J Clin Invest. 104(4):375-381), Cells were cultured for 10 days 
in the presence of Compound A (20 ^iM). Culture medium was replaced every four days 
and the presence of dystrophin in myoblast cultures was detected by immunostaining as 
described previously (Barton-Davis et al, 1999, J Clin Invest. 104(4):375-381). A primary 
monoclonal antibody to the C-terminus of the dystrophin protein (F19A12) was used 
undiluted and rhodamine conjugated anti-mouse IgG was used as the secondary antibody. 
The F19A12 antibody will detect the full-length protein produced by suppression of the 
nonsense codon. Staining was viewed using a Leica DMR micropscope, digital camera, and 
associated imaging software at the University of Pennsylvania. As shown in Figure 9, 
full-length dystrophin protein is produced and localized to the muscle myotubes in cultures 
treated with 20 Compound A and gentamicin (200 /xM). In addition, cells from 
untreated cultures exhibited minimal staining. These resuhs indicate that full-length 
dystrophin protein is produced as a consequence of nonsense suppression. 

7.8. Compound A and Compound B Cause Readthrough of 
Premature Termination Codons in the mdx Mouse 

Since the results of the mdx cell culture experiments demonstrated 
production of full-length dystrophin in cells treated with Compound A, it was asked 
whether suppression of the nonsense codon in the mdx mouse could be observed. As 
previously described (Barton-Davis et aL, 1999, J Clin Invest. 104(4):375-381), compound 
was delivered by Alzet osmotic pumps implanted under the skin of anesthetized mice. Two 
doses of Compound A were administered. Gentamicin served as a positive control and 
pumps filled with solvent only served as the negative control. Pumps were loaded with 
appropriate compound such that the calculated doses to which tissue was exposed were 10 
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/iM and 20 ^M. The gentamicin concentration was calculated to achieve tissue exposure of 
approximately 200 iiM. In the initial experiment, mice were treated for 14 days, after which 
animals were anesthetized with ketamine and exsanguinated. The tibialis anterior (TA) 
muscle of the experimental animals was then excised, frozen, and used for 
immunoftuorescence analysis of dystrophin incorporation into striated muscle. The 
presence of dystrophin in TA muscles was detected by immunostaining, as described 
previously (Barton-Davis et ai, 1999, J Clin Invest. 104(4):375-381; see mdx primary cells 
in Section 7.8 supra). As shown in Figure 10. these experiments demonstrated that mice 
treated with both concentrations of compound elicited production of full-length dystrophin. 
Importantly, a significant portion of the full-length dystrophin protein was properly 
localized to the membrane. These important results demonstrate that Compound A can 
fiinction in an animal model. 

8. HUMAN DISF.ASE GENES SORTED BY rHROMOSOME 



Gene 


GDB Accession ID 


ujviiiyi IjIok 


ABCA4 


GDB:370748 


MACULAR DEGENERATION, SENILE 
STARGARDT DISEASE 1; STGDl ATP 
BINDING CASSETTE TRANSPORTER; 
ABCR RETINITIS PIGMENTOSA- 19; 
RP19 


ABCD3 


GDB:131485 


PEROXISOMAL MEMBRANE PROTEIN 
1;PXMP1 


ACADM 


GDB:118958 


ACYL-CoA DEHYDROGENASE, 
MEDIUM-CHAIN; ACADM 


AGL 


GDB:132644 


GLYCOGEN STORAGE DISEASE lU 


AGT 


GDB:118750 


ANGIOTENSIN I; AGT 


ALDH4A1 


GDB:9958827 


HYPERPROLINEMIA, TYPE II 



- 108- 



( ( 

wo 2004/001010 PCT/US2003/019760 



ALPL 


GDB: 118730 


I r- <ln.i B— 1^ liur .^1 iLI,..l ..J |. -Jl. "ilM .1*' lO 

PHOSPHATASE, LIVER ALKALINE; 
ALPL HYPOPHOSPHATASIA, 
INFANTILE 


AMPDl 


GDB: 119677 


ADENOSINE MONOPHOSPHATE 
DEAMINASE- 1 ; AMPD 1 


APOA2 


GDB: 119685 


APOLIPOPROTEIN A-H; APOA2 


AVSDl 


GDB:265302 


ATRIOVENTRICULAR SEPTAL 
DEFFCT- AVSD 


BRCD2 


GDB:9955322 


BREAST CANCER, DUCTAL, 2; BRCD2 


CIQA 


GDB: 11 9042 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, ALPHA 
POLYPEPTIDE; CIQA 


CIQB 


GDB: 119043 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, BETA 
POLYPEPTIDE; CIQB 


CIQG 


GDB:128132 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, GAMMA 
POLYPEPTIDE; CIQG 


C8A 


GDB: 11 9735 


COMPLEMENT COMPONENT-8, 
DEFICIENCY OF 


C8B 


GDB: 119736 


COMPLEMENT COMPONENT-8, 
DEFICIENCY OF, TYPE U 


CACNAIS 


GDB: 126431 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, L TYPE, 
ALPHA IS SUBUNIT; CACNAIS 
PERIODIC PARALYSIS I MALIGNANT 
HYPERTHERMIA SUSCEPTIBILITY-5; 
MHS5 
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ccv 


GDB: 1336655 


CATARACT, CONGENITAL, 
VOLKMANN TYPE; CCV 


CD3Z 


GDB: 119766 


CD3Z ANTIGEN, ZETA POLYPEPTIDE; 
CD3Z 




GDB- 127827 


PROTEIN KINASE o58: PK58 






CHOROIDEREMIA-LIKE: CHML 


CHSl 


GDB:4568202 


CHEDIAK-HIGASHI SYNDROME; CHSl 


CIASl 


GDB:9957338 


COLD HYPERSENSITIVITY 
URTICARIA, DEAFNESS, AND 
AMYLOIDOSIS 








CLCNKB 


GDB:698472 


CHLORIDE CHANNEL. KIDNEY, B; 
CLCNKB 


CMDIA 


GDB:434478 


CARDIOMYOPATHY, DILATED lA; 
CMDIA 


CMH2 


GDB:137324 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC 2- CMH2 


CMM 


GDB: 119059 


MELANOMA. MALIGNANT 


COLllAl 


GDB:120595 


COLLAGEN, TYPE XI, ALPHA-1; 
COLl 1 Al 








COL9A2 


GDB: 1383 10 


COLLAGEN. TYPE DC, ALPHA-2 CHAIN; 
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" 1. II I ri..r niirr -niTi- i-n- r t i 

COL9A2 EPIPHYSEAL DYSPLASIA, 
MULTIPLE, 2; EDM2 


CPT2 


GDB: 127272 


MYOPATHY WITH DEFICIENCY OF 
CARNITINE 

PALMITOYLTRANSFERASE II 
HYPOGLYCEMIA, HYPOKETOTIC, 
WITH DEFICIENCY OF CARNlIINb 
PALMITOYLTRANSFERASE 
CARNITINE 

T> AT TVyfTTOVT TP A'WQ'PFT? AQF TT- PPT9 




LiL/JtS . J J J y J u 










CSE 


GDB:596182 


CHOREOATHETOSIS/SPASTICITY, 
EPISODIC; CSE 


CSF3R 


GDB: 126430 


COLONY STIMULATING FACTOR 3 


CTPA 


GDB:9863168 


CATARACT, POSTERIOR POLAR 


CTSK 


GDB:453910 


PYCNODYSOSTOSIS CATHEPSIN K; 
CTSK 


DBT 


GDB: 118784 


MAPLE SYRUP URINE DISEASE, TYPE 
2 


DIOl 


GDB: 136449 


THYROXINE DEIODINASE TYPE I; 


DISCI 


GDB-9992707 


DISORDER-2- SCZD2 
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DPYD 


GDB:364102 


DIHYDROPYRIMIDINE 
DEHYDROGENASE; DPYD 


EKV 


GDB:n9106 


ERYTHROKERATODERMIA 
VARIABILIS; EKV 


ENOl 


GDB: 119871 


PHOSPHOPYRUVATE HYDRATASE; 
PPH 


ENOIP 


GDB: 135006 


PHOSPHOPYRUVATE HYDRATASE; 
PPH 


EPB41 


GDB: 119865 


ERYTHROCYTE MEMBRANE PROTEIN 
BAND 4.1; EPB41 HEREDITARY 
HEMOLYTIC 


EPHXl 


GDB: 119876 


EPOXIDE HYDROLASE 1, 
MICROSOMAL- EPHXl 


r 1 Jo 




FACTOR Xm B SUBUNIT; F13B 


F5 


GDB: 119896 


FACTOR V DEFICIENCY 


FCGR2A 


GDB: 119903 


Fc FRAGMENT OF IgG, LOW AFFINITY 
Ha, RECEPTOR FOR; FCGR2A 


FCGR2B 


GDB:128183 


Fc FRAGMENT OF IgG, LOW AFFINITY 
na, RECEPTOR FOR; FCGR2A 


FCGR3A 


GDB: 119904 


Fc FRAGMENT OF IgG, LOW AFFINITY 
ma, RECEPTOR FOR; FCGR3 A 


FCHL 


GDB:9837503 


HYPERLIPEDEMIA, COMBINED 
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FH 


GDB:1 19133 


FUMARATE HYDRATASE; FH 
LEIOMYOMATA, HEREDITARY 
MULTIPLE, OF SKIN 


FM03 


GDB:135136 


FLAVDM-CONTAINING 
MONOOXYGENASE 3; FM03 
TRIMETHYLAMnsfURIA 


FM04 


GDB: 127981 


FLAVIN-CONTAINING 


FUCAl 


CjDB:1 19237 


r UCUbli-JUolo 


r Y 


OLll>:l 19242 


ol^UUU ljKLlUi:^--UUrr Y a i o 1 CriVly ry 


A T C 


OLlorl 19243 


CjAI^AC 1 Uoii llr'lMiiKAoil IJlirlUliiiNC^ Y 




OiJxS.l 19202 


O ATTr^XJT70 TMCT? A Ct7 T^VDTJ T- CITS T 








GFND 


GDB:9958222 


GLOMERULAR NEPHRITIS, FAMILL\L, 
WITH FIBRONECTIN DEPOSITS 


GJA8 


GDB:696369 


CATARACT, ZONULAR PULVERULENT 
1 ; CZPl GAP JUNCTION PROTEIN, 
ALPHA.8, 50-KD; GJA8 


GJB3 


GDB:127820 


ERYTHROKERATODERMIA 
VARIABILIS; EKV DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 2; 








GLC3B 


GDB:3801939 


GLAUCOMA 3, PRIMARY INFANTILE, 
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B- GLC3B 


HFl 


GDB: 120041 


H FACTOR 1;HF1 


HMGCL 


GDB: 138445 


HYDROXYMETHYLGLUTARICACIDUR 
lA; HMGCL 


HPCl 


GDB:5215209 


PROSTATE CANCER; PRCAl 
PROSTATE CANCER, HEREDITARY 1 


HRD 


GDB:9862254 


HYPOPARATHYROIDISM WITH SHORT 
STATURE, MENTAL RETARDATION, 
AND SEIZURES 


HRPT2 


GDB: 125253 


HYPERPARATHYROIDISM, FAMILIAL 
PRIMARY, WITH MULTIPLE 
nS^TFYTNG JAW 


HSD3B2 


GDB: 134044 


ADRENAL HYPERPLASIA II 


HSPG2 


GDB: 126372 


HEPARAN SULFATE PROTEOGLYCAN 

MYOTONIC MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND OCULAR 
ANDFACLaiL 


KCNQ4 


GDB:439046 


DEAFNESS. AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 2; 
DFNA2 








KCS 


GDB:9848740 


KENNY-CAFFEY SYNDROME, 
RECESSIVE FORM 


KIFIB 


GDB: 128645 


CHARCOT-MARIE-TOOTH DISEASE, 
NEURONAL TYPE, A; CMT2A 
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LAMB3 


GDB:251820 


LAMININ, BETA 3; LAMB3 


LAMC2 


GDB:136225 


LAMININ, GAMMA 2; LAMC2 
EPIDERMOLYSIS BULLOSA LETALIS 


LGMDIB 


GDB:231606 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE IB; LGMDIB 


LMNA 


GDB:132146 


LAMIN A/C; LMNA LIPODYSTROPHY, 
FAMILL\L PARTIAL, DUNNIGAN TYPE; 
LDFl 


LOR 


GDB: 132049 


LORICRIN; LOR 


MCKDl 


GDB:9859381 


POLYCYSTIC KIDNEYS, MEDULLARY 
1 I rt 


MCLl 


GDB:139137 


MYELOID CELL LEUKEMIA 1; MCLl 


MPZ 


GDB:125266 


HYPERTROPHIC NEUROPATHY OF 
DEJERINE-SOTTAS MYELIN PROTEIN 
ZERO; MPZ 


MTHFR 


GDB:370882 


5,10-@METHYLENETETRAHYDROFOL 
ATE REDUCTASE; MTHFR 


MTR 


GDB: 1 19440 


METHYLTETRAHYDROFOLATE:L-HO 
MOCYSTEINE 

S-METHYLTRANSFERASE; MTR 


MUTYH 


GDB:9315115 


ADENOMATOUS POLYPOSIS OF THE 
COT ON- APC 
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MYCX: 


GDB:5584221 


GLAUCOMA 1, OPEN ANGLE; GLCIA 
MYOCILIN; MYOC 


NB 


GDB:9958705 


NEUROBLASTOMA; NB 


NCF2 


GDB: 120223 


GRANULOMATOUS DISEASE, 
CHRONIC, AUTOSOMAL 
CYTOCHROME-b-POSmVE FORM 


NEMl 


GDB:127387 


NEMALINE MYOPATHY 1, 
AUTOSOMAL DOMINANT; NEMl 


NPHS2 


GDB:9955617 


ARRHYTHMOGENIC RIGHT 
VENTRICULAR DYSPLASL\, 
FAMILIAL, 2; ARVD2 


NPPA 


GDB: 118727 


NATRIURETIC PEPTIDE PRECURSOR 
A- NPPA 


NRAS 


GDB: 119457 


ONCOGENE NRAS; NRAS; NRASl 


NTRKl 


GDB: 127897 


ONCOGENE TRK NEUROTROPHIC 

1 YKUoUNli JSJiN/votl, rvUV-^JLlr^ 1 V-ix\., 1 lire* 

1 ; NTRKl NEUROPATHY, 
CONGENITAL SENSORY, WITH 
ANHIDROSIS 


OPTA2 


GDB:9955577 


OSTEOPETROSIS, AUTOSOMAL 
DOMINANT, TYPE II; OPA2 


PBXl 


GDB: 125351 


PRE-B-CELL LEUKEMIA 
TRANSCRIPTION FACTOR- 1; PBXl 


PCHC 


GDB:9955586 


PHEOCHROMOCYTOMA 


PGD 


GDB: 119486 


6-@PHOSPHOGLUCONATE 
DEHYDROGENASE, ERYTHROCYTE 
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PHA2A 


GDB:9955628 


PSEUDOHYPOALDOSTERONISM, TYPE 
n;PHA2 


PHGDH 


GDB:9958261 


3-@PHOSPHOGLYCERATE 
DEHYDROGENASE DEFICIENCY 


PKLR 


GDB: 120294 


PYRUVATE KINASE DEHCIENCY OF 

FP VTTTR CiCWP 
ct\. I 1 ntvv^W' I 1 c> 


PKPl 


GDB:4249598 


PLAKOPHILIN 1; PKPl 


PLA2G2A 


GDB: 120296 


PHOSPHOLIPASE A2, GROUP IIA; 
PLA2G2A 


PLOD 


GDB: 127821 


PROCOLLAGEN-LYSINE, 
2-OXOGLUTARATE 5-DIOXYGENASE; 
PLOD EHLERS-DANLOS SYNDROME, 
TYPE VI; E-D VI; EDS VI 


PPOX 


GDB: 118852 


PROTOPORPHYRINOGEN OXIDASE; 
PPOX 


PPT 


GDB: 125227 


CEROID-LIPOFUSCINOSIS, NEURONAL 
1, INFANTILE; CLNl 
PALMITOYL-PROTEDSf 
THIOESTERASE; PPT 


PRCC 


GDB:3888215 


PAPILLARY RENAL CELL 
CARCINOMA; PRCC 


PRG4 


GDB:9955719 


ARTHROPATHY-CAMPTODACTYLY 
SYNDROMF 








PSEN2 


GDB:633044 


ALZHEIMER DISEASE, FAMILL\L, 
TYPE 4; AD4 
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PTOSl 


GDB:6279920 


PTOSIS, HEREDITARY CONGENITAL 1; 


JvCiN 




"RFNnsT- REN 


RFX5 


GDB:6288464 


REGULATORY FACTOR 5; RFX5 


RHD 


GDB:119551 


RHESUS BLOOD GROUP, D ANTIGEN; 

xvxlJL^ 


RMDl 


GDB:448902 


Rff FLING MUSCLE DISEASE- 1; RMDl 


RPE65 


GDB:226519 


RETINAL FlOMcJN 1 
EPITHELIUM-SPECIFIC PROTEIN, 
65-KD; RPE65 AMAUROSIS 
CONGENITA OF LEBER H 


SCCD 


GDB:9955558 


CORNEAL DYSTROPHY, 


SERPINCl 


GDB:1 19024 


ANTITHROMBIN HI DEFICIENCY 


SJSl 


GDB:1381631 


MYOTONIC MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND OCULAR 
AND FACIAL 


SLC19A2 


GDB:9837779 


THIAMINE-RESPONSIVE 
MEGALOBLASTIC ANEML\ 
SYNDROME 


SLC2A1 


GDB: 120627 


SOLUTE CARRIER FAMILY 2, MEMBER 
1; SLC2A1 






FT T TPTnCYTO"?!*? RHF<?T I<s-TrMT TNTCFFI 
TYPE HEREDITARY HEMOLYTIC 
SPECTRIN, ALPHA, ERYTHROCYTIC 1; 
SPTAl 
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TALI 


GDB:120759 


T-CELL ACUTE LYMPHOCYTIC 
LEUKEMIA 1; TALI 


TNFSF6 


GDB:422178 


APOPTOSIS ANTIGEN LIGAND 1; 
APT1T Gl 


TNNT2 


GDB:221879 


TROPONIN-T2, CARDIAC; TNNT2 


TPM3 


GDB: 127872 


ONCOGENE TRK TROPOMYOSIN 3; 
TPM3 


TSHB 


GDB: 120467 


THYROID-STIMULATING HORMONE, 
BETA CHAIN; TSHB 


UMPK 


GDB: 120481 


URIDINE MONOPHOSPHATE KINASE; 

UlYLrJv 


IIOV 


nriTi* 1 97<C'^o 


f TP ATT7 OYTTI TTOY 


xJlsXJiJ 


nVXH' 1 1 Q#^'>Q 






I^TMl ■ 1 OOAd 
KJXJO . 1 ZU^O J 


TTQUTTO CVTMTll? r^livf "C TVPTJ TT- T TCW^ 

Uorlcxs. o I IN JJlv^JVLc, 1 I fii H, xJ^rlZ 


VMGLOM 


GDB:9958134 


GLOMUS TUMORS, MULTIPLE 


vws 


GDB: 120532 


CLEFT LIP AND/OR PALATE WITH 
MUCOUS CYSTS OF LOWER LIP 


WS2B 


GDB:407579 


WAARDENBURG SYNDROME, TYPE 
2B; WS2B 



Table 4: Genes, Locations and Genetic Disorders on Chromosome 2 



Gene 


GDB Accession ID 


Location 


OMIM Link 
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ABCBll 


GDB:9864786 


2q24-2q24 
2q24.3-2q24.3 


CHOLESTASIS, PROGRESSIVE 
FAMILIAL INTRAHEPATIC 2; 
PFIC2 




nnR- 1 0450298 


2D21-2t521 


PHYTOSTEROLEMIA 


ABCG8 


GDB: 10450300 


2p21-2p21 


PHYTOSTEROLEMIA 


ACADL 


GDB: 118745 


2q34-2q35 


ACYL-CoA DEHYDROGENASE, 
LONG-CHAIN, DEHCIENCY OF 


ACPI 


GDB: 11 8962 


2p25-2p25 


PHOSPHATASE, ACID, OF 
FT^YTHROCYTE- ACPI 


AGXT 


GDB:127113 


2q37.3-2q37.3 


OXALOSIS I 


AHHR 


GDB: 118984 


2pter-2q31 


CYTOCHROME P450, 
SUBFAMILY I, POLYPEPTIDE 1; 
CYPlAl 


ALMSl 


GDB:9865539 


2pl3-2pl2 
2pl4-2pl3 
2pl3.1-2pl3.1 


ALSTROM SYNDROME 


ALPP 


GDB: 119672 


2q37.1-2q37.1 


ALKALINE PHOSPHATASE, 
PLACENTAL; ALPP 


ALS2 


GDB:135696 


2q33-2q35 


AMYOTROPHIC LATERAL 
SCLEROSIS 2, JUVENILE; ALS2 


APOB 


GDB:1 19686 


2p24-2p23 
2p24-2p24 


APOLIPOPROTEIN B; APOB 


BDE 


GDB:9955730 


2q37-2q37 


BRACHYDACTYLY, TYPE E; 
BDE 


BDMR 


GDB:533064 


2q37-2q37 


BRACHYDACTYLY-MENTAL 
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1 ir 


RETARDATION SYNDROME; 
BDMR 


BJS 


GDB:9955717 


2q34-2q36 


TORTI AND NERVE DEAFNESS 




D . ^ /I O O /f 1 


2q33-2q34 


PT TT Ayf OM A P V MVPFPTPTM^TOM 
IT KJ LjIVI\JiS /VJtV. I n I X Hlv 1 CiN o 1\-/1N , 

PRIMARY; PPHl BONE 
MORPHOGENETIC RECEPTOR 
TYPE II; BMPR2 


CHRNAl 


GDB: 120586 


2q24-2q32 


CHOLINERGIC RECEPTOR, 
NICOTINIC, ALPHA 
POLYPEPTIDE 1; CHRNAl 


CMCWTD 


GDB: 11498919 


2p22.3-2p2l 


FAMILIAL CHRONIC 
MUCOCUTANEOUS, 
DOMINANT TYPE 


CNGA3 


01-iB:4343yo 




CYCLIC NUCLEOTIDE GATED 
CHAlvrNEL, OLFACTORY, 3; 
CNG3 






2q3i-zq3z.3 
2q32.2-2q32,2 


i^Ul^LAvjClN, 1 I rt 111, CUi-jA 1 
EHLERS-DANLOS SYNDROME, 
TYPE IV, AUTOSOMAL 
DOMINANT 


COL4A3 


GDB:128351 


2q36-2q37 


COLLAGEN, TYPE IV, ALPHA-3 
CHAIN; COL4A3 


COL4A4 


GDB:132673 


2q35-2q37 


COLLAGEN, TYPE IV, ALPHA-4 
CHAIN; COL4A4 


POT 


ODB-l 1Q066 


2a57 3-2fl37 3 


COLLAGEN TYPE VI ALPHA-3 
CHAIN; COL6A3 MYOPATHY, 
BENIGN CONGENITAL, WITH 
CONTRACTURES 
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CPSl 


GDB: 119799 


2q33-2q36 
2q34-2q35 
2q35-2q35 


HYPERAMMONEMIA DUE TO 
CARBAMOYLPHOSPHATE 
SYNTHETASE I DEFICIENCY 


CRYGA 


GDB: 119076 


2q33-2q35 


CRYSTALLBSr, GAMMA A; 
CRYGA 


CRYGEPl 


GDB: 119808 


2q33-2q35 


CRYSTALLIN, GAMMA A; 
CRYGA 


CYPIBI 


GDB:353515 


2p21-2p21 
2p22-2p21 
2pter-2qter 


GLAUCOMA 3, PRIMARY 
INFANTILE, A; GLC3A 
C Y 1 Ul^rlKvJMJi r'*j\), 
SUBFAMILY I 
(DIOXIN-INDUCIBLE), 
POLYPEPTIDE 1; CYPIBI 


CYP27A1 


GDB:128129 


2q33-2qter 


CEREBROTENDINOUS 
XANTHOMATOSIS 


DBI 


GDB: 119837 


2ql2-2q21 


DIAZEPAM BINDING 
TMRrRTTOR- DBI 


DES 


GDB: 119841 


2q35-2q35 


DESMDSF; DES 


DYSF 


GDB:340831 


2p-2p 

2pl3-2pl3 

2pter-2pl2 


MUSCULAR DYSTROPHY, 

T nvvTQ ^^T"D"riT T? 'TVT>T7 
J-yJJVlo-Oi-KJL/t^Ji, 1 irij 

LGMD2B MUSCULAR 
DYSTROPHY, LATE-ONSET 
DISTAL 


EDAR 


GDB:9837372 


2qll-2ql3 


DYSPLASIA, HYPOHIDROTIC 
ECTODERMAL DYSPLASIA, 
ANHIDROTIC 


EFEMPl 


GDB: 12201 11 


2pl6-2pl6 


DOYNE HONEYCOMB 
DEGENERATION OF RETINA 
FDBRILLIN-LIKE; FBNL 
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EIF2AK3 


GDB:995o743 


2pl2*2pl2 


EFlPrlYbtAL DYSPLASIA, 
MULTIPLE, WITH 
EARLY-ONSET DIABETES 
MELLITUS 


ERCC3 


GDB:1 19881 


2q21-2q21 


bXClolUN-KtrAUK, 

COMPLEMENTING DEFECTIVE, 
IN CHINESE HAMSTER, 3; 
ERCC3 


FSHR 


GDB:127510 


2p21-2pl6 


FOLLICLE-STIMULATING 
HORMONE RECEPTOR; FSHR 
GONADAL DYSGENESIS, XX 
TYPE 


GADl 


GDB: 119244 


2q31-2q31 


PYRIDOXINE DEPENDENCY 
WITH SEIZURES 


GINGF 


GDB:9848875 


2p21-2p21 


GINGIVAL SON OF SEVENLESS 
(DROSOPHILA) HOMOLOG 1; 
SOSl 


GLCIB 


GDB: 1297553 


2ql-2ql3 


GLAUCOMA 1, OPEN ANGLE. B; 
GLCIB 


GPD2 


GDB:354558 


2q24.1-2q24.1 


GLYCEROL-3-PHOSPJL\TE 


GYPC 


GDB: 120027 


2ql4-2q21 


BLOOD GROUP-GERBICH; Ge 


T T A TNT T A 

HADHA 


GDB:43402o 


2p23-2p23 


HYDROXY ACYL-CoA 
DEHYDROGENASE/3-KETOACY 
L-CoA THIOLASE/ENOYL-CoA 
HYDRATASE, 




vji-'o . J*Kf y J J 


zpz^-zpzj 


ri I i^rvwA. 1 I L»-i^o/\ 
DEHYDROGENASE/3-KETOACY 
L-CoA THIOLASE/ENOYL-CoA 
HYDRATASE, 
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HOXD13 


GDB: 127225 


2q31-2q31 


HOMEO BOX-D13; HOXD13 
SYNDACTYLY, TYPE II 


HPE2 


GDB: 136066 


2p21-2p21 


MIDLINE CLEFT SYNDROME 


IGKC 


GDB: 120088 


2pl2-2pl2 
2pll.2-2pll.2 


IMMUNOGLOBULIN KAPPA 
CONSTANT REGION; IGKC 


Lrirl 


KjUd.^ 1 IZUJ 


2q35-2q35 
2pter-2qter 


RR APTTVDACTYT Y TYPEAl" 
BDAl INDL\N HEDGEHOG, 
DROSOPHILA, HOMOLOG OF; 
IHH 


IRSl 


GDB: 133974 


2q36-2q36 


INSULIN RECEPTOR 
SUBSTRATE 1; lElSl 


1 1 VjrVO 


fiDR- 128027 


2Dter-2ater 


INTEGRIN, ALPHA-6; ITGA6 






2o23 3-2d23.2 


FRUCTOSURIA 


KYNU 


GDB:9957925 


2q22.2-2q23.3 




LCT 


GDB: 120140 


2q21-2q21 


DISACCHARDDE INTOLERANCE 

n 


LHCGR 


GDB: 125260 


2p21-2p21 


LUTEINIZING 

HORMONE/CHORIOGONADOTR 
OPDSr RECEPTOR; LHCGR 


LSFC 


GDB:9956219 


2-2 2pl6-2pl6 


CYTOCHROME c OXIDASE 
DEFICIENCY, 
FRENCH-CANADL\N TYPE 


MSH2 


GDB:203983 


2pl6-2pl6 
2p22-2p21 


COLON CANCER, FAMILL^L, 
NONPOLYPOSIS TYPE 1; FCCl 


MSH6 


GDB:632803 


2pl6-2pl6 


G/T MISMATCH-BINDING 
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PROTEIN; GTBP 


NEB 


GDB: 120224 


2q24.1-2q24.2 


NEBULIN; NEB NEMALINE 
MYOPATHY 2, AUTOSOMAL 
RECESSIVE; NEM2 


NMTC 


GDB: 11498336 


2q21-2q21 


THYROID CARCINOMA, 
PAPILLARY 


NPHPl 


GDB: 128050 


2ql3-2ql3 


NEPHRONOPHTHISIS, 
FAMILIAL JUVENILE 1 ; NPHPl 


PAFAHIPI 


GDB:435099 


2pll.2-2pll.2 


PLATELET-ACTIVATING 
FACTOR ACETYLHYDROLASE, 
GAMMA SUBUNn 


PAX3 


GDB: 120495 


2q36-2q36 
2q35-2q35 


KLEIN- WAARDENBURG 
SYNDROME WAARDENBURG 
SYNDROME; WSl 


PAX8 


GDB: 136447 


2ql2-2ql4 


PAIRED BOX HOMEOTIC GENE 
8;PAX8 


PMSl 


GDB:386403 


2q31-2q33 


POSTMEIOTIC SEGREGATION 
INCREASED (S. CEREVISIAE)-1 ; 
PMSl 


PNKD 


GDB:5583973 


2q33-2q35 


CHOREOATHETOSIS, FAMILIAL 
PAROXYSMAL; FPDl 


PPHl 


GDB:1381541 


2q31-2q32 
2q33-2q33 


PULMONARY HYPERTENSION, 
PRIMARY; PPHl 


PROC 


GDB:120317 


2ql3-2q21 
2ql3-2ql4 


PROTEIN C DEHCIENC Y, 
CONGENITAL THROMBOTIC 
DISEASE DUE TO 


REGIA 


GDB: 132455 


2pl2-2pl2 


REGENERATING 
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[ ^ 


ISLET-DERIVED 1 -ALPHA; 
REGIA 


SAG 


GDB: 120365 


2q37.1-2q37.1 


S-ANTIGEN; SAG 


SFTPB 


GDB: 120374 


2pl2-2pll.2 


SURF ACT ANT-ASSOCL\TED 
PROTEIN, PULMONARY-3; 
SFrP3 


SLCllAl 


GDB:371444 


2q35-2q35 


CIRRHOSIS, PRIMARY; PBC 

MATT TR AT 

RESISTANCE-ASSOCIATED 
MACROPHAGE PROTEIN 1; 
NRAMPl 


SLC3A1 


GDB:202968 


2pl6.3-2pl6.3 
2p21-2p21 


SOLUTE CARRIER FAMILY 3, 
MEMBER 1;SLC3A1 
CYSTINURIA; CSNU 


SOSl 


GDB:230004 


2p22-2p21 


GINGIVAL SON OF SEVENLESS 
(DROSOPHILA) HOMOLOG 1; 
SOSl 


SPG4 


GDB:230127 


2p24-2p21 


SPASTIC PARAPLEGIA-4, 
AUTOSOMAL DOMINANT; SPG4 


SRD5A2 


GDB: 127343 


2p23-2p23 


PSEUDOVAGINAL 
PERINEOSCROTAL 
HYPOSPADIAS; PPSH 


TCL4 


GDB: 136378 


2q34-2q34 


T-CELL 

LEUKEMIA/LYMPHOMA-4; 
TCL4 


TGFA 


GDB: 120435 


2pl3-2pl3 


TRANSFORMING GROWTH 
FACTOR, ALPHA; TGFA 


TMD 


GDB:9837196 


2q31-2q31 


TIBL\L MUSCULAR 
DYSTROPHY. TARDIVE 
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TPO 


GDB: 120446 


2p25-2p25 
2p25-2p24 


THYROID HORMONOGENESIS, 
GENETIC DEFECT IN. IIA 


UGTl 


GDB: 120007 


2q37-2q37 


UDP 

GLUCURONOSYLTRANSFERAS 
E 1 FAMILY. Al ; UGTl Al 


UV24 


GDB:9955737 


2pter-2qter 


UV-DAMAGE, EXCISION 
REPAIR OF, UV-24 


WSS 


GDB:9955707 


2q32-2q32 


WRINKLY SKIN SYNDROME; 

W 


XDH 


GDB:266386 


2p23-2p22 


XANTHINUIUA 


ZAP70 


GDB:433738 


2qll-2ql3 
2ql2-2ql2 


SYK-RELATED TYROSINE 
KINASE; SRK 


ZFHXIB 


GDB:9958310 


2q22-2q22 


DISEASE, MICROCEPHALY, 
AND IRIS COLOBOMA 



Table 5: Genes, Locations and Genetic Disorders on Chromosome 3 



Gene 


GDB Accession ID 


Location 


OMIM Linic 


ACAAl 


GDB: 119643 


3p23-3p22 


PEROXISOMAL 
3-OXOACYL-COENZYME A 
THIOLASE DEFICIENCY 


AGTRl 


GDB: 132359 


3q21-3q25 


ANGIOTENSIN 11 RECEPTOR, 
VASCULAR TYPE 1; AT2R1 


AHSG 


GDB: 118985 


3q27-3q27 


ALPHA-2-HS-GLYCOPROTEIN; 
AHSG 


AMT 


GDB: 132 138 


3p2l.3-3p21.2 
3p21.2-3p2l.l 


HYPERGLYCINEMIA, ISOLATED 
NONKETOTIC, TYPE II; NKH2 
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ARP 


GDB:9959049 


1 ^ 

3p21.1-3p21.1 


ARGININE-RICH PROTEIN 


BBSS 


GDB:376501 


3p-3p 

3pl2.3-3qll.l 


BARDET-BIEDL SYNDROME, 
TYPE 3; BBS3 


BCHE 


GDB: 120558 


3q26.I-3q26.2 


BUTYRYLCHOLINESTERASE; 
BCHE 


BCPM 


GDB:433809 


3q21-3q21 


BENIGN CHRONIC PEMPHIGUS; 
BCPM 


BTD 


GDB:309078 


3p25-3p25 


BIOTINIDASE; BTD 


CASK 


GDB:134196 


3q21-3q24 


HYPOCALCIURIC 
HYPERCALCEMIA, FAMILIAL; 
HHCl 


CCR2 


GDB:337364 


3p21-3p21 


CHEMOKINE (C-C) RECEPTOR 2; 
CMKBR2 


CCR5 


GDB:1230510 


3p21-3p21 


CHEMOKINE (C-C) RECEPTOR 5; 
CMKBR5 


CDLl 


GDB: 136344 


3q26.3-3q26.3 


DE LANGE SYNDROME; CDL 


CMT2B 


GDB:604021 


3ql3-3q22 


CHARCOT-MARIE-TOOTH 
DISEASE, NEURONAL TYPE, B; 
CMT2B 


COL7A1 


GDB: 128750 


3p21-3p21 
3p21.3-3p21.3 


COLLAGEN. TYPE VU, ALPHA-1; 
COL7A1 


CP 


GDB: 119069 


3q23-3q25 
3q21-3q24 


CERULOPLASMIN; CP 


CRV 


GDB:1 1498333 


3p21.3-3p21.1 


VASCULOPATHY, RETINAL, 
WITH CEREBRAL 
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LEUKODYSTROPHY 


CTNNBl 


GDB:141922 


3p22-3p22 
3p21.3-3p21.3 


CATENIN, BETA 1; CTNNBl 


DEM 


GDB:681157 


3pl2-3qll 


DEMENTIA, FAMILIAL 
NONSPECIFIC; DEM 


ETMl 


GDB:9732523 


3ql3-3ql3 


TREMOR, HEREDITARY 
ESSENTL\L 1;ETM1 


FANCD2 


GDB:698345 


3p25.3-3p25.3 
3pter-3p24.2 


FANCONI PANCYTOPENIA, 
COMPLEMENTATION GROUP D 


FIH 


GDB:9955790 


3ql3-3ql3 


HYPOPARATHYROIDISM, 
FAMILIAL ISOLATED; FIH 


FOXL2 


GDB: 129025 


3q23-3q23 
3q22-3q23 


BLEPHAROPHIMOSIS, 
EPICANTHUS INVERSUS, AND 
PTOSIS; BPES 


GBEl 


GDB:138442 


3pl2-3pl2 


GLYCOGEN STORAGE DISEASE 

rv 


GLBl 


GDB: 119987 


3p22-3p21.33 

3p21.33-3p21. 

33 


GANGLIOSIDOSIS, 
GENERALIZED GMl, TYPE I 


GLCIC 


GDB:3801941 


3q21-3q24 


GLAUCOMA 1, OPEN ANGLE, C; 

GLCIC 


GNAI2 


GDB: 120516 


3p21.3-3p21.2 


GUANINE 

NUCLEOTIDE-BINDING 
PROTEIN, ALPHA-INHIBITING, 
POLYPEPTIDE-2; 


GNATl 


GDB: 119277 


3p21.3-3p21.2 


GUANINE 

NUCLEOTIDE-BINDING 
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PROTEIN, 

ALPHA-TRANSDUCING, 
POLYPEPTIDE 


GP9 


GDB: 126370 


3pter-3qter 


PLATELET GLYCOPROTEIN IX; 
GP9 


GPXl 


GDB: 119282 


3qll-3ql2 


GLUTATHIONE PEROXIDASE; 


HGD 


GDB:203935 


3q21-3q23 


ALKAPTONURIA; AKU 


HRG 


GDB: 120055 


3q27-3q27 


HISTIDINE-RICH 
GLYCOPROTEIN; HRG; HRGP 


ITIHl 


GDB: 120107 


3p21.2-3p21.1 


INTER-ALPHA-TRYPSIN 
INHIBITOR, HEAVY CHAIN- 1; 
ITIHl; L\TIH; ITIH 


KNG 


GDB: 125256 


3q27-3q27 


FLAUJEAC FACTOR 
DEFICIENCY 


LPP 


GDB: 139 1795 


3q27-3q28 


LIM DOMAIN-CONTAINING 
PREFERRED TRANSLOCATION 
PARTNER IN LIPOMA; LPP 


LRSl 


GDB:682448 


3p21.1-3pl4.1 


LARSEN SYNDROME, 
AUTOSOMAL DOMINANT; LRSl 


MCCCl 


GDB: 135989 


3q27-3q27 
3q25-3q27 


BETA-METHYLCROTONYLGLY 
CINURIAI 


MDSl 


GDB:250411 


3q26-3q26 


MYELODYSPLASIA SYNDROME 
1;MDS1 


MHS4 


GDB:574245 


3ql3.1-3ql3.1 


HYPERTHERML\ 
SUSCEPTIBILITY-4; MHS4 
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MUF 


GDB:2 14776 


3pl4.1-3pl2 


MICROPHTHALMIA-ASSOCIATE 
D TRANSCRIPTION FACTOR; 
MITF WAARDENBURG 
SYNDROME, TYPE II; WS2 


MLHl 


GDB:249617 


3p23-3p22 
3p21.3-3p21.3 


COLON CANCER, FAMILIAL, 
NONPOLYPOSIS TYPE 2; FCC2 


MYL3 


GDB:120218 


3p21.3-3p21.2 


MYOSIN, LIGHT CHAIN, 
ALKALI, VENTRICULAR AND 
oivbLrili Ai^ oJLV-IW, MYLJ 


MYMY 


CjDB: 1 1 5006 1 0 


Jpzo-jp24.2 




OPAl 


GDB: 118848 


3q28-3q29 


OPTIC ATROPHY 1; OPAl 


PBXPl 


GDB: 125352 


3q22-3q23 


PRE-B^CELL LEUKEMIA 
TRANSCRIPTION FACTOR- 1; 


PCCB 


GDB: 119474 


3q21-3q22 


GLYCINEML\, KETOTIC, H 


POUIFI 


GDB: 129070 


3pll-3pll 


POU DOMAIN, CLASS 1, 
TRANSCRIPTION FACTOR 1; 
POUIFI 


PPARG 


GDB: 12238 10 


3p25-3p25 


CANCER OF COLON 
PEROXISOME PROLIFERATOR 
ACTIVATED RECEPTOR, 
GAMMA; PPARG 


PROSl 


GDB: 120721 


3pll-3qll 
3pll.l-3qll.2 


PROTEIN S, ALPHA; PROSl 


T>TIJ1> 1 

r irlKl 






Mil 1 Ar M 1 atA-L 

CHONDRODYSPLASIA, MURK 
JANSEN TYPE PARATHYROID 
HORMONE RECEPTOR 1; PTHRl 
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RCAl 


GDB:230233 


3pl4.2-3pl4.2 


RENAL CARCINOMA, 
FAMILIAL, ASSOCIATED 1; 
RCAl 


RHO 


GDB: 120347 


3q21.3-3q24 


RHODOPSO^; RHO 


SCA7 


GDB:454471 


3p21.1-3pl2 


SPINOCEREBELLAR ATAXIA 7; 
SCA7 


SCLCl 


GDB:9955750 


3p23-3p21 


SMALL-CELL CANCER OF THE 
LUNG; SCCL 


SCN5A 


GDB: 132152 


3p21-3p21 


SODIUM CHANNEL, 
VOLTAGE-GATED, TYPE V, 
ALPHA POLYPEPTIDE; SCN5A 


SI 


GDB: 120377 


3q25.2-3q26.2 


DISACCHARIDE INTOLERANCE 
I 


SLC25A2 
0 


GDB:6503297 


3p21.31-3p21. 
31 


CARNITINE-ACYLCARNITINE 
TRANSLOCASE; CACT 


SLC2A2 


GDB: 11 9995 


3q2o.2-3q27 
3q26.1-3q26.3 


Qr\J TTT'ir r^ATJOFCD T7A"\>fTT V O 

oU-LU 1 11 UAi<-KXcK rAiVlULrX Z, 

MEMBER 2; SLC2A2 
FANCONI-BICKEL SYNDROME; 
FBS 


TF 


GDB: 120432 


3q21-3q21 


TRANSFERRIN; TF 


TGFBR2 


GDB:224909 


3p22-3p22 
3pter-3p24.2 


TRANSFORMING GROWTH 
FACTOR-BETA RECEPTOR, 
TYPE IT- TGFBR2 


THPO 


GDB:374007 


3q26.3-3q27 


THROMBOPOIETIN; THPO 


THRB 


GDB: 120731 


3p24.1-3p22 
3p24.3-3p24.3 


THYROID HORMONE 
RECEPTOR, BETA; THRB 
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TKT 


GDB: 132402 


3pl4.3-3pl4.3 


WERNICKE-KORSAKOFF 
SYNDROME 


TM4SF1 


GDB:250815 


3q21-3q25 


TUMOR-ASSOCIATED ANTIGEN 
L6; TAAL6 


TRH 


GDB: 128072 


3pter-3qter 


THYROTROPIN-RELEASING 


UMPS 


GDB: 120482 


3ql3-3ql3 


OROTICACIDURIA I 


UQCRCl 


GDB: 141 850 


3p21.3-3p21.2 
3p21.3-3p21.3 


UBIQUINOL-CYTOCHROME c 
REDUCTASE CORE PROTEIN I; 
UQCRCl 


USH3A 


GDB:392645 


3q21-3q25 


USHER SYNDROME, TYPE HI; 
USH3 


VHL 


GDB: 120488 


3p26-3p25 


VON HIPPEL-LINDAU 
SYNDROME; VHL 


WS2A 


GDB:1280S3 


3pl4.2-3pl3 


MICROPHTHALMIA-ASSOCIATE 
D TRANSCRIPTION FACTOR; 
MTTF WAARDENBURG 
SYNDROME, TYPE U; WS2 


XPC 


GDB: 134769 


3p25.1-3p25.1 


XERODERMA PIGMENTOSUM, 
COMPLEMENTATION GROUP C; 
XPC 


ZNF35 


GDB: 120507 


3p21-3p21 


ZINC FINGER PROTEIN-35; 
ZNF35 



Table 6: Genes, Locations and Genetic Disorders on Chromosome 4 



Gene 


GDB Accession ID 


Location 


OMIM Link 
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ADHIB 


GDB: 119651 


4q21-4q23 
4q22-4q22 


ALCOHOL 

DEHYDROGENASE-2; ADH2 


ADHIC 


GDB: 119652 


4q21-4q23 

An'JO.An'yy 
Hq^^'tq^^ 


ALCOHOL 

DEHYDROGENASE-3' ADH3 


AFP 


GDB: 1 19660 


4qll-4ql3 


ALPHA-FETOPROTEIN; AFP 


AGA 


GDB: 118981 


4q23-4q35 
4q32-4q33 


ASPARTYLGLUCOSAMINURL\ 
; AGU 


Affl2 


GDB: 118751 


4qll-4ql3 
4ql3.3-4q21.2 


AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL, 
ATTTOQOMAT DOMINANT* 


ALB 


GDB: 118990 


4qll-4ql3 


ALBUMIN; ALB 


ASMD 


GDB: 119705 


4q-4q 


ANTERIOR SEGMENT OCULAR 


BFHD 


GDB: 11498907 


4q34.1-4q35 


DYSPLASL\, BEUKES TYPE 


CNGAl 


GDB: 127557 


4pl4-4ql3 


CYCLIC NUCLEOTIDE GATED 
CHANNEL, PHOTORECEPTOR, 
rCifi/TP nATFD 1 • CNCGl 


CRBM 


GDB:9958132 


4pl6.3-4pl6.3 


CHERUBISM 


DCK 


GDB: 1268 10 


4ql3.3-4q21.1 


DEOXYCYTIDINE KINASE; 
DCK 


DFNA6 


GDB:636175 


4pl6.3-4pl6.3 


DEAFNESS. AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 6; DFNA6 


DSPP 


GDB:5560457 


4pter-4qter 
4q21.3-4q21.3 


DENTIN PHOSPHOPROTEIN; 
DPP DENTINOGENESIS 
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llvij'iiivjrxiUl A, UOli 


DLurZ 




4q-4q 


T^PXITrM PiVQPT AQTA TVHPP IT 
L^HIN 1 ilN U X oJrl^/VolA, 1 I X 1-f 11 


ELONG 


GDB: 11498700 


4q24-4q24 




ENAM 


GDB:9955259 


4q21-4q21 


AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL, 
AUTOSOMAL DOMINANT; 
AMELOGENESIS IMPERFECTA, 
HYPOPLASTIC TYPE 


ETFDH 


GDB: 135992 


4q32-4q35 


GLUTARICACIDURIA HC; GA 

nc 


EVC 


GDB:555573 


4pl6-4pl6 


ELLIS-VAN CREVELD 
biNUROMJi; 11 VC 


Fll 


GDB: 119891 


4q35-4q35 


PTA DEFICIENCY 


FABP2 


GDB: 11 9127 


4q28-4q31 


FATTY ACID BINDING 
PROTEIN 2, INTESTINAL; 
FABP2 


FGA 


GDB:119129 


4q28-4q28 


AMYLOIDOSIS. FAMILIAL 
VISCERAL FIBRINOGEN, A 
ALPHA POLYPEPTIDE; FGA 


FOB 


GDB:1 19130 


4q28-4q28 


FIBRINOGEN, B BETA 
POLYPEPTIDE; FGB 


FGFR3 


GDB: 127526 


4pl6.3-4pl6.3 


ACHONDROPLASIA; ACH 
BLADDER CANCER 
FIBROBLAST GROWTH 
FACTOR RECEPTOR-3; FGFR3 


EGG 


GDB: 11 91 32 


4q28-4q28 


FIBRINOGEN, G GAMMA 
POLYPEPTIDE; FGG 
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FSHMDl 
A 



GDB: 119914 



4q35-4q35 



FACIOSCAPULOHUMEBIAL 
MUSCULAR DYSTROPHY 1 A; 
FSHMDl A 



GC 



GDB: 119263 



4ql2-4ql3 
4ql2-4ql2 



GROUP-SPECIFIC 
COMPONENT; GC 



GNPTA 



GDB: 119280 



4q21-4q23 



MUCOLIPIDOSIS U; ML2; ML H 



GNRHR 



GDB: 136456 



4ql3-4ql3 
4q21.2-4q21.2 



GONADOTROPIN-RELEASING 
HORMONE RECEPTOR; 
GNRHR 



GYPA 



GDB: 11 8890 



4q28-4q31 
4q28.2-4q31.1 



BLOOD GROUP-MN LOCUS; 
MN 



HCA 



GDB:9954675 



4q33-4qter 



HYPERCALCIURIA, FAMILIAL 
IDIOPATHIC 



HCL2 



GDB:1 19305 



4q28-4q31 
4q-4q 



HAIR COLOR-2; HCL2 



HD 



GDB:1 19307 



4pl6.3-4pl6.3 



HUNTINGTON DISEASE; HD 



HTN3 



GDB:125601 



4ql2-4q21 



HISTATIN-3; HTN3 



HVBS6 



GDB: 120687 



4q32-4q32 



HEPATOCELLULAR 
CARCINOMA-2; HCC2 



IDUA 



GDB: 119327 



4pl6.3-4pl6.3 



MUCOPOLYSACCHARIDOSIS 
TYPE I; MPS I 



IF 



GDB: 120077 



4q24-4q25 
4q25-4q25 



COMPLEMENT COMPONENT-3 
INACTIVATOR, DEFICIENCY 
OF 



JPD 



GDB: 1201 13 



4pter-4qter 



PERIODONTITIS. JUVENILE; 
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4ql2-4ql3 


JPD 


KIT 


GDB:120117 


4ql2-4ql2 


V-KIT HARDY-ZUCKERMAN 4 
FELINE SARCOMA VIRAL 
ONCOGENE HOMOLOG; KIT 


KLKBl 


GDB: 127575 


4q34-4q35 
4q35-4q35 


FLETCHER FACTOR 
DEFICIENCY 


LQT4 


GDB:682072 


4q25-4q27 


SYNDROME WITHOUT 

PSYCHOMOTOR 

RETARDATION 


MANBA 


GDB:125261 


4q21-4q25 


MANNOSIDOSIS, BETA; 
MANBl 


MLLT2 


GDB: 136792 


4q21-4q21 


MYELOID/LYMPHOID OR 
MIXED LINEAGE LEUKEMIA, 
TRANSLOCATED TO, 2; MLLT2 


MSXl 


GDB: 120683 


4pl6.3-4pl6.1 
4pl6.1-4pl6,l 


MSH, DROSOPHILA, HOMEO 
BOX. HOMOLOG OF, 1; MSXl 


MTP 


GDB:228961 


4q24-4q24 


MICROSOMAL TRIGLYCERIDE 
TRANSFER PROTEEM. 88 KD; 
MTP 


NR3C2 


GDB: 1201 88 


4q31-4q31 
4q31.1-4q31.1 


PSEUDOHYPOALDOSTERONIS 
M, TYPE I, AUTOSOMAL 


PBT 


GDB: 120260 


4ql2-4q21 


PIEBALD TRAIT; PBT 


PDE6B 


GDB: 125915 


4pl6.3-4pl6.3 


NIGHTBLINDNESS, 
CONGENITAL STATIONARY; 
CSNB3 PHOSPHODIESTERASE 
6B, cGMP-SPECIFIC, ROD, 
BETA; PDE6B 
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PEEl 


GDB:7016765 


4q31-4q34 
4q25-4qter 


1;PEE1 


PITX2 


GDB: 134770 


4q25-4q27 
4q25-4q26 
4q25-4q25 


nUDOGONIODYSGENESIS, 
TYPE 2; rRID2 RIEGER 
SYNDROME, TYPE 1 ; RIEG 1 

HOMEOBOX TRANSCRIPTION 
FACTOR 1; RIEGl HOMEO BOX 
2 


PKD2 


GDB:1 18851 


4q21-4q23 


POLYCYSTIC BODNEY 
DISEASE 2; PKD2 


QDPR 


GDB:120331 


4pl5.3-4pl5.3 
4pl5.31-4pl5. 
31 


PHENYLKETONURIA II 


SGCB 


GDB:702072 


4ql2-4ql2 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2E; 
LGMD2E 


SLC25A4 


GDB:1 19680 


4q35-4qJ5 


TRANSLOCATOR 1; ANTl 
PROGRESSIVE EXTERNAL 
OPHTHALMOPLEGIA; PEO 


SNCA 


GDB:439047 


4q21.3.4q22 
4q21-4q21 


SYNUCLEIN, ALPHA; SNCA 
PARKINSON DISEASE, 
FAMILIAL, TYPE 1; PARKl 


SODS 


GDB: 125291 


4pl6.3-4q21 


SUPEROXIDE DISMUTASE, 
EXTRACELLULAR: SOD3 


STATH 


GDB: 120391 


4qll-4ql3 


STATHERIN; STATH; STR 


TAPVRl 


GDB:392646 


4pl3-4qll 


ANOMALOUS PULMONARY 
VENOUS RETURN; APVR 
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TYS 


GDB: 119624 


4q-4q 


SCLEROTYLOSIS; TYS 


WBS2 


GDB: 132426 


4q33-4q35.l 


WILLIAMS-BEUREN 
SYNDROME; WBS 


WFSl 


GDB:434294 


4p-4p 
4pl6-4pl6 


DIABETES MELLITUS AND 
INSIPIDUS WITH OPTIC 
ATROPHY AND DEAFNESS 


WHCR 


GDB: 125355 


4pl6.3-4pl6.3 


WOLF-HIRSCHHORN 
SYNDROME; WHS 



Table 7: Genes, Locations and Genetic Disorders on Chromosome 5 



Gene 


GDB Accession ID 


OMIM Link 


ADAMTS2 


GDB:9957209 


EHLERS-DANLOS SYNDROME, TYPE 
Vn, AUTOSOMAL RECESSIVE 


ADRB2 


GDB: 120541 


BETA-2-ADRENERGIC RECEPTOR; 
ADRB2 


AMCN 


GDB:9836823 


ARTHROGRYPOSIS MULTIPLEX 
CONGENITA, NEUROGENIC TYPE 


AP3B1 


GDB:9955590 


HERMANSKY-PUDLAK SYNDROME; 
HPS 


APC 


GDB:1 19682 


ADENOMATOUS POLYPOSIS OF THE 
COLON; APC 


ARSB 


GDB: 119008 


MUCOPOLYSACCHARIDOSIS TYPE 
VI; MPS VI 


B4GALT7 


GDB:9957653 


SYNDROME, PROGEROID FORM 
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BHRl 


GDB:9956078 


ASTHMA 


C6 


GDB: 119045 


COMPLEMENT COMPONENT-6, 
DEFICIENCY OF 


C7 


GDB:119046 


COMPLEMENT COMPONENT-7, 
DEFICIENCY OF 


CCAL2 


GDB:5584265 


CHONDROCALCINOSIS, FAMILIAL 
ARTICULAR 


CKNl 


GDB: 128586 


COCKAYNE SYNDROME, TYPE I; 
CKNl 


CMDJ 


GDB:9595425 


CRANIOMETAPHYSEAL DYSPLASIA, 
JACKSON TYPE; CMDJ 


CRHBP 


GDB: 127438 


CORTICOTROPIN RELEASING 
HORMONE-BINDING PROTEIN; 
CRHBP 


CSFIR 


GDB: 120600 


COLONY-STIMULATING FACTOR-1 
RECEPTOR; CSFIR 


DHFR 


GDB: 119845 


DIHY'DROFOLATE REDUCTASE; 
DHFR 


DIAPHl 


GDB:9835482 


DEAFNESS. AUTOSOMAL 
DOMTMANT NONSYNDROMIC 
SENSORINEURAL, 1; DFNAl 
DL\PHANOUS, DROSOPHILA, 
HOMOLOG OF. 1 


DTR 


GDB: 11 9853 


DIPHTHERL\ TOXIN SENSITIVITY; 
DTS 


EOS 


GDB:9956083 


EOSINOPHILIA, FAMILIAL 
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ERVR 


GDB:9835857 


HYALOIDEORETINAL 
DEGENERATION OF WAGNER 


F12 


GDB: 119892 


HAGEMAN FACTOR DEFICIENCY 


FBN2 


GDB:128122 


CONTRACTURAL 

ARACHNODACTYLY, CONGENITAL; 
CCA 


GDNF 


GDB:450609 


GLIAL CELL LINE-DERIVED 
NEUROTROPfflC FACTOR; GDNF 


GHR 


GDB: 119984 


GROWTH HORMONE RECEPTOR; 
GHR 


GLRAl 


GDB: 118801 


GLYCINE RECEPTOR, ALPHA-1 
SUBUNIT; GLRAl KOK DISEASE 


GM2A 


GDB: 120000 


TAY-SACHS DISEASE, AB VARIANT 


HEXB 


GDB: 119308 


S ANDHOFF DISEASE 


HSD17B4 


GDB:385059 


1 7-@BETA-H YDROXYSTEROID 
DEHYDROGENASE IV; HSD17B4 


ITGA2 


GDB: 128031 


INTEGRIN, ALPHA-2; ITGA2 


KFS 


GDB:9958987 


VERTEBRAL FUSION 


LGMDIA 


GDB: 118832 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE lA; LGMDIA 


LOX 


GDB: 119367 


LYSYL OXIDASE; LOX 


LTC4S 


GDB-384080 


LEUKOTRIENE C4 SYNTHASE- LTC4S 
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MAN2A1 


GDB:136413 


MANNOSIDASE, ALPHA, H; MANA2 
DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE II 


MCC 


GDB:128163 


MUTATED IN COLORECTAL 
CANCERS; MCC 


MCCC2 


GDB: 135990 


n 


MSH3 


GDB:641986 


MutS, E. COLI, HOMOLOG OF, 3; 
MSH3 


MSX2 


GDB: 138766 


MSH (DROSOPHILA) HOMEO BOX 
HOMOLOG 2; MSX2 PARIETAL 
FORAMINA, SYMMETRIC; PFM 


NR3C1 


GDB:120017 


GLUCOCORTICOID RECEPTOR; GRL 


PCSKl 


GDB: 128033 


PROPROTEIN CONVERT ASE 
SUBTILISIN/KEXESf TYPE 1; PCSKl 


PDE6A 


GDB: 120265 


PHOSPHODIESTERASE 6A, 
cGMP-SPECIFIC, ROD, ALPHA; PDE6A 


PFBI 


GDB:9956096 


INTENSITY OF INFECTION IN 


RASAl 


GDB:120339 


RAS p21 PROTEIN ACTIVATOR 1; 
RASAl 


SCZDl 


GDB: 120370 


DISORDER-1; SCZDl 


SDHA 


GDB:378037 


SUCCINATE DEHYDROGENASE 
COMPLEX, SUBUNIT A, 
FLAVOPROTEIN; SDHA 


SGCD 


GDB:5886421 


S ARCOGLYCAN, DELTA; SGCD 



- 142- 



wo 2004/001010 



( 

PCT/US2003/019760 



SLC22A5 


GDB:9863277 


CARNITINE DEFICIENCY. SYSTEMIC. 
DUE TO DEFECT IN RENAL 
REABSORPTION 


SLC26A2 


GDB: 125421 


DIASTROPHIC DYSPLASIA; DTD 
EPIPHYSEAL DYSPLASIA, 
MULlirLt; McU JNtUNAl AL, 
OSSEOUS DYSPLASL\ I 
ACHONDROGENESIS, TYPE IB; 
ACGIB 


SLC6A3 


GDB: 132445 


SOLUTE CARRIER FAMILY 6, 
MEMBER 3; SLC6A3 
DEFICIT-HYPERACTIVITY 
DISORDER; ADHD 


SMI 


GDB:9834488 


SCHISTOSOMA MANSONI 
SUSCEPTIBILITY/RESISTANCE 


SMA@ 


GDB:120378 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 1, TELOMERIC; SMNl 


SMNl 


GDB:5215173 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 1, TELOMERIC; SMNl 


SMN2 


GDB:5215175 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 


SPINK5 


GDB:9956114 


NETHERTON DISEASE 


TCOFl 


GDB: 127390 


TREACHER 

COLLINS-FRANCESCHETTI 
SYNDROME 1; TCOFl 


TGFBI 


GDB:597601 


CORNEAL DYSTROPHY, GRANULAR 
TYPE CORNEAL DYSTROPHY, 
LATTICE TYPE I; CDLl 
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TRANSFORMING GROWTH FACTOR, 
BETA-INDUCED, 68 KD; TGFBI 



Table 8: Genes, Locations and Genetic Disorders on Chromosome 6 



Gene 


GDB Accession ID 


OMIM Link 


ALDH5A1 


GDB:454767 


SUCCINIC SEMIALDEHYDE 
DEHYDROGENASE, 
NAD(+)-DEPENDENT; SSADH 


ARGl 


GDB: 11 9006 


ARGININEMIA 


AS 


GDB: 135697 


ANKYLOSING SPONDYLITIS; AS 


ASSP2 


GDB: 119017 


CITRULLINEMIA 


BCKDHB 


GDB: 118759 


MAPLE SYRUP URINE DISEASE. TYPE 
IB 


BF 


GDB:1 19726 


GLYCINE-RICH 
BETA-GLYCOPROTEIN; GBG 


C2 


GDB: 119731 


COMPLEMENT COMPONENT-2, 
DEFICIENCY OF 


C4A 


GDB: 119732 


COMPLEMENT COMPONENT 4A; C4A 


CDKNIA 


GDB:266550 


CYCLIN-DEPENDENT KINASE 
INHIBITOR lA; CDKNIA 


COLlOAl 


GDB: 128635 


COLLAGEN, TYPE X, ALPHA 1 ; 
COLlOAl 


COL11A2 


GDB: 119788 


COLLAGEN, TYPE XI, ALPHA-2; 

COLl I A2 STICKLER SYNDROME, TYPE 



- 144- 



wo 2004/001010 



I 

PCT/US2003/019760 







II; STL2 DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 13; DFNA13 


CYP21A2 


GDB: 120605 


ADRENAL HYPERPLASL\., 

CONGENITAL, DUE TO 

Z 1 -rl I UKwA I i-rAoxl Uilr H^ICIN \^ I 


DYX2 


GDB:437584 


DYSLEXL^, SPECIFIC, 2; DYX2 


EJMl 


GDB: 119864 


MYOCLONIC EPILEPSY, JUVENILE; 
EJMl 


ELOVL4 


GDB: 11499609 


STARGARDT DISEASE 3; STGD3 


EPM2A 


GDB:3763331 


EPILEPSY, PROGRESSIVE MYOCLONIC 
2; brM2 


ESRl 


GDB:1 19120 


ESTROGEN RECEPTOR; ESR 


EYA4 


GDB:700062 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
10; UrNAlO 


F13A1 


GDB:120614 


FACTOR Xin, AI SUBUNTT; F13A1 


FANCE 


GDB: 1220236 


FANCONI ANEMIA, 
COMPLEMENTATION GROUP E; FACE 


GCLC 


GDB: 13291 5 


GAMMA-GLUTAMYLCYSTEINE 
SYNTHETASE DEFICIENCY, 
HEMOLYTIC ANEMIA DUE 


GJAl 


GDB:125196 


GAP JUNCTION PROTEIN. ALPHA-1, 43 


GLYSl 


GDB: 136421 


GLYCOSURIA. RENAL 
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GMPR 


GDB: 127058 


GUANINE MONOPHOSPHATE 
REDUCTASE 


GSE 


GDB:9956235 


DISEASE; CD 


HCR 


GDB:9993306 


PSORIASIS, SUSCEPTIBILITY TO 
HFEGDB:1 19309 HEMOCHROMATOSIS; 
HFE 


HLA-A 


GDB: 119310 


HLA-A HloTOCOMFAlJJtSlLll Y lYx'xl, 
HLAA HLA-DPBl GDB: 120636 LA-DP 
HISTOCOMPATIBILITY TYPE, BETA-1 
SUBUNIT 


HLA-DRA 


GDB: 120641 


HLA-DR HISTOCOMPATIBILITY TYPE; 
HLA-DRA 


HPFH 


GDB:9849006 


HETEROCELLULAR HEREDITARY 
PERSISTENCE OF FETAL 


ICSl 


GDB: 136433 


IMMOTILE CILL\ SYNDROME-1; ICSl 


IDDMl 


GDB:9953173 


DIABETES MELLITUS, 
JUVENILE-ONSET 
INSULIN-DEPENDENT; IDDM 


IFNGRl 


GDB: 120688 


INTERFERON, GAMMA, RECEPTOR-1; 
TFNGRl 


IGADl 


GDB:6929077 


SELECTIVE DEFICIENCY OF 


IGF2R 


GDB: 120083 


INSULIN-LrKE GROWTH FACTOR 2 
RECEPTOR; IGF2R 


ISCW 


GDB:9956158 


SUPPRESSION; IS 
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LAMA2 


GDB: 132362 


LAMININ, ALPHA 2; LAMA2 


LAP 


GDB:9958992 


LARYNGEAL ADDUCTOR PARALYSIS; 
T AP 








LPA 


GDB:120699 


APOLIPOPROTEIN(a); LPA 


MCDRl 


GDB:131406 


MACULAR DYSTROPHY, RETINAL, 1, 
NORTH CAROLINA TYPE; MCDRl 


MOCSl 


GDB:9862235 


MOLYBDENUM COFACTOR 
DEFICIENCY 


MUT 


GDB:120204 


METHYLMALONICACIDURIA DUE TO 
METHYLMALONIC CoA MUTASE 
DEFICIENCY 


MYB 


GDB: 119441 


V-MYB AVL\N MYELOBLASTOSIS 


NEUl 


GDB: 120230 


NEURAMINIDASE DEFICIENCY 


NKSl 


GDB:128100 


SUSCEPTIBILITY TO LYSIS BY 
ALLOREACTIVE NATURAL KILLER 


NYS2 


GDB:9848763 


NYSTAGMUS, CONGENITAL 


OAS 


GDB: 136429 


ALBINISM. OCULAR, AUTOSOMAL 
RECESSIVE; OAR 


ODDD 


GDB:6392584 


OCULODENTODIGITAL DYSPLASIA; 
ODDD 
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nnR-l 90747 


OROFACIAL CLEFT 1' OFCl 


PARK2 


GDB:6802742 


PARKINSONISM, JUVENILE 


PBCA 


GDB:9956321 


BETA CELL AGENESIS WITH 
NEONATAL DIABETES MELLITUS 


PBCRAl 


GDB:3763333 


CHORIORETINAL ATROPHY, 

PROnPF^s^TVF RrFO<~'AT • PR APR 




(jL>Jt>.13oJ4y 




PEX3 


GDB:9955507 


ZELLWEGER SYNDROME; ZS 


PEX6 


GDB:5592414 


ZELLWEGER SYNDROME; ZS 
PEROXlN-6; PEX6 


PEX7 


GDB:6 155803 


RHIZOMELIC CHONDRODYSPLASIA 
PUNCTATA; RCDP PEROXIN-7; PEX7 


PKHDl 


GDB:433910 


POLYCYSTIC KIDNEY AND HEPATIC 
DISEASE- 1; PKHDl 


PLA2G7 


GDB:9958829 


PLATELET- ACTIVATING FACTOR 
APFTYT HYDROT ASF SIIBTJMTT 


PLG 


GDB: 119498 


PLASMINOGEN; PLG 


POLH 


GDB:6963323 


PIGMENTOSUM WITH NORMAL DNA 
REPAIR RATES 


PPAC 


GDB:9956248 


ARTHROPATHY, PROGRESSIVE 
PSEUDORHEUMATOID. OF 
CHILDHOOD 
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PSORSl 


GDB:6381310 


PSORIASIS, SUSCEPTIBILITY TO 


PUJO 


GDB:9956231 


MULTICYSTIC RENAL DYSPLASIA, 




vj . J J J y z y 




RDS 


GDB:118863 


RETINAL DEGENERATION, SLOW; RDS 


RHAG 


GDB: 136011 


RHESUS BLOOD GROUP-ASSOCIATED 
GLYCOPROTEIN; RHAG RH-NULL, 
REGULATOR TYPE; RHN 


RP14 


GDB:433713 


RETINITIS PIGMENTOSA-14; RP14 
TUBBY-LIKE PROTEIN 1; TULPl 


RUNX2 


GDB:392082 


CLEIDOCRANIAL DYSPLASIA; CCD 

CORE-BINDING FACTOR. RUNT 
r\/^\>r A rM a t t>u a ct tot tmtt' i • r*XiT7 a i 














SCZD3 


GDB:635974 


DISORDER-3; SCZD3 


SIASD 


GDB:433552 


SIALIC ACID STORAGE DISEASE; 
SL\SD 


SOD2 


GDB: 119597 


SUPEROXIDE DISMUTASE 2, 


O X o 


GDB -6 118456 


OVARIAN TI JMOR 


TAPl 


GDB: 132668 


TRANSPORTER 1, ABC; TAPl 
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TAP2 


GDB: 132669 


TRANSPORTER 2, ABC; TAP2 


TFAP2B 


GDB:681506 


DUCTUS ARTERIOSUS; PDA 
TRANSCRIPTION FACTOR AP-2 BETA; 
TFAP2B 


TNDM 


GDB:9956265 


DIABETES MELLITUS. TRANSIENT 
NEONATAL 


TNF 


GDB:120441 


TUMOR NECROSIS FACTOR; TNF 


TPBG 


GDB:125568 


TROPHOBLAST GLYCOPROTEIN; 
TPBG; M6P1 


TPMT 


GDB:209025 


TfflOPURINE 

S-METHYLTRANSFERASE; TPMT 


TULPl 


GDB:61 99353 


TUBBY-LKE PROTEIN 1; TULPl 


WISP3 


GDB:9957361 


ARTHROPATHY, PROGRESSIVE 
PSEUDORHEUMATOID, OF 
CHILDHOOD 


Table 9: Genes. Locations and Genetic Disorders on Chromosome 7 


Gene 


GDB Accession ID 


OMIM Link 


AASS 


GDB: 11502144 


HYPERLYSINEMIA 


ABCBl 


GDB: 1207 12 


P-GLYCOPROTEDSf-l; PGYl 


ABCB4 


GDB: 12071 3 


P-GLYCOPROTEIN-3; PGY3 


ACHE 


GDB: 118746 


ACETYLCHOLINESTERASE BLOOD 
GROUP-Yt SYSTEM; YT 
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AQPl 


GDB: 129082 


AQUAPORDSf-l; AQPl BLOOD 
ripni IP— POT TOM- po 








A OXTC 


kjUd. I ly /uo 




AUTSl 


GDB:9864226 


DISORDER 


BPGM 


GDB: 119039 


DIPHOSPHOGLYCERATE MUTASE 
L)b,r U^lhNK^i Dt llKY IrlKUCY 111 


C7orf2 


GDB: 10794644 


ACHEIROPODY 


CACNA2D1 


GDB: 132010 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, L TYPE, 
ALPHA-2/DELTA SUBUNIT; 
MALIGNANT HYPERTHERMIA 
SUSCEPTIBILITY-3 


CCMl 


GDB:580824 


CEREBRAL CAVERNOUS 
MAJLrUKMAIKJXNo 1, UUVll 


CD36 


GDB:138800 


CD36 ANTIGEN; CD36 


CFTR 


GDB: 120584 


CYSTIC FIBROSIS; CF DEFERENS, 
CONGENITAL BILATERAL APLASIA OF; 


CHORDOMA 


GDB: 11498328 




CLCNl 


GDB: 134688 


CHLORIDE CHANNEL 1, SKELETAL 
MUSCLE; CLCNl 


CMH6 


GDB:9956392 


CARDIOMYOPATHY. FAMILIAL 
HYPERTROPHIC, WITH 
WOLFF-PARKINSON-WHITE 
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CMT2D 


GDB:9953232 


CHARCOT-MARIE-TCX)TH DISEASE, 
NEURONAL TYPE, D 


COL1A2 




rriT I Ar;PTsi typf tat pha-2 
POLYPEPTIDE; COL1A2 OSTEOGENESIS 
IMPERFECTA TYPE I OSTEOGENESIS 
IMPERFECTA TYPE IV: OI4 






CRANIOSYNOSTOSIS TYPE 1* CRSl 


CYMD 


GDB:366594 


MACULAR EDEMA, CYSTOID 


DFNA5 


GDB:636174 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 5; 
DFNA5 


DLD 


GDB: 120608 


LIPOAMIDE DEHYDROGENASE 
DEFICIENCY, LACTIC ACIDOSIS DUE 
TO 


DYTll 


GDB:10013754 


MYOCLONUS, HEREDITARY 
ESSENTIAL 


EECl 


GDB: 136338 


ECTRODACTYLY, ECTODERMAL 
DYSPLASIA, AND CLEFT LIP/PALATE; 
EEC 


ELN 


GDB: 119107 


ELASTIN; ELN WILLIAMS-BEUREN 
SYNDROME- WBS 


TJXVl 

r^i V 1 




ETS VARIANT GENE 1 ; ETVl 


FKBP6 


GDB:9955215 


WILLIAMS-BEUREN SYNDROME; WBS 


GCK 


GDB: 127550 


DL\BETES MELLITUS, AUTOSOMAL 
DOMINANT, TYPE H GLUCOKINASE; 
GCK 
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GHRHR 


GDB: 138465 


GROWTH HORMONE-RELEASING 
HDRMONF RECEPTOR- GHRHR 


GHS 


GDB:9956363 


MICROSOMIA WITH RADIAL DEFECTS 


GLI3 


GDB: 11 9990 


PALLISTER-HALL SYNDROME; PHS 
GLI-KRUPPEL FAMILY MEMBER 3; 
GLI3 POST AXIAL rOL YUAU 1 YL Y, 
TYPE Al GREIG 
CEPHALOPOLYSYNDACTYLY 
SYNDROME; GCPS 


GPDSl 


GDB:9956410 


GLAUCOMA, PIGMENT-DISPERSION 


GUSB 


GDB: 120025 


MUCOPOLYSACCHARIDOSIS TYPE VD 


HADH 


GDB: 120033 


HYDROXYACYL-CoA 
DEHYDROGENASE/3-KETOACYL-CoA 
THIOLASE/ENOYL-CoA HYDRATASE, 


HLXB9 


GDB: 136411 


HOMEO BOX GENE HB9; HLXB9 
SACRAL AGENESIS. HEREDITARY, 
WITH PRESACRAL MASS, ANTERIOR 


HOXA13 


GDB: 120656 


HOMEO BOX A13; HOXA13 


HPFH2 


GDB: 128071 


HEREDITARY PERSISTENCE OF FETAL 
HEMOGLOBIN, HETEROCELLULAR, 








lAB 


GDB: 11498909 


ANEURYSM, INTRACRANIAL BERRY 


IMMP2L 


GDB:1 1499195 


GILLES DE LA TOURETTE SYNDROME; 
GTS 
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LONG QT SYNDROME, TYPE 2; LQT2 




GDBl 19357 


LAMININ BETA 1; LAMBl 






LEPTIN- LEP 


MET 


GDB: 120178 


MET PROTO-ONCOGENE; MET 


NCFl 


GDB: 120222 


GRANULOMATOUS DISEASE, 
CHRONIC. AUTOSOMAL 
CYTOCHROME-b-POSITIVE FORM 


NM 


GDB: 119454 


NEUTROPHIL CHEMOTACTIC 
RESPONSE; NCR 


OGDH 


GDB: 118847 


ALPHA-KETOGLUTARATE 
DEHYDROGENASE DEFICIENCY 


OPNISW 


GDB: 119032 


TRITANOPIA 


PEXl 


GDB:9787110 


ZELLWEGER SYNDROME; ZS 
PEROXIN-l;PEXl 


PGAM2 


GDB: 120280 


PHOSPHOGLYCERATE MUTASE, 
DEFICIENCY OF M SUBUNIT OF 


PMS2 


GDB:386406 


POSTMEIOTIC SEGREGATION 
INCREASED (S. CEREVISIAE)-2; PMS2 


PONl 


GDB: 120308 


PARAOXONASE 1; PONl 


PPP1R3A 


GDB: 136797 


PROTEIN PHOSPHATASE 1, 
REGULATORY (INHIBITOR) SUBUNIT 3; 
PPP1R3 
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PRSSl 


GDB:119620 


PANCREATITIS, HEREDITARY; PCTT 
PROTFASF SERINE 1- PRSSl 


PTC 


GDB: 118744 


PHENYLTHIOCARBAMIDE TASTING 


PTPN12 


GDB: 136846 


PROTEBSr-TYROSINE PHOSPHATASE, 




kjDd. /oO 




RP9 


GDB:333931 


RETINITIS PIGMENTOSA-9; RP9 


SERPINEl 


GDB: 120297 


PLASMINOGEN ACTIVATOR 
INHIBITOR, TYPE I; PAH 


SGCE 


GDB:9958714 


MYOCLONUS, HEREDITARY 
ESSENTIAL 


SHFMl 


GDB: 128 195 


SPLIT-HAND/FOOT DEFORMITY, TYPE 
I; SHFDl 


SHH 


GDB:456309 


HOLOPROSENCEPHALY, TYPE 3; HPE3 
SONIC HEDGEHOG, DROSOPHILA, 
HOMOLOG OF; SHH 


SLC26A3 


GDB: 138 165 


DOWN-REGULATED IN ADENOMA; 
DRA CHLORIDE DL\RRHEA, FAMILIAL; 
CLD 


SLC26A4 


GDB:5584511 


PENDRED SYNDROME; PDS DEAFNESS, 
NEUROSENSORY, AUTOSOMAL 


SLOS 


GDB:385950 


SMITH-LEMLI-OPITZ SYNDROME 


SMADl 


GDB:3763345 


SPINAL MUSCULAR ATROPHY, 
DISTAL, WITH UPPER LIMB 
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PREDOMINANCE; SMADl 


TBXASl 


GDB: 128744 


THROMBOXANE A SYNTHASE 1 ; 
TBXASl 


TWIST 


GDB: 135694 


ACROCEPHALOSYNDACTYLY TYPE III 
TWIST. DROSOPHILA, HOMOLOG OF; 
TWIST 


ZWSl 


GDB: 1205 11 


ZELLWEGER SYNDROME; ZS 



Table 10: Genes, Locations and Genetic Disorders on Chromosome 8 



Gene 


GDB AccessionID 


OMIM Link 


ACHM3 


GDB:9 120558 


PDMGELAPESE BLINDNESS 


ADRB3 


GDB:203869 


BETA-3-ADRENERGIC RECEPTOR; ADRB3 


ANKl 


GDB: 118737 


SPHEROCYTOSIS, HEREDITARY; HS 


CAl 


GDB: 119047 


CARBONIC ANHYDRASE I, ERYTHROCYTE, 
ELECTROPHORETIC VARIANTS OF; CAl 


CA2 


GDB: 119739 


OSTEOPETROSIS WTTH RENAL TUBULAR 
ACIDOSIS 


CCALl 


GDB:512892 


CHONDROCALCINOSIS WITH 
EARLY-ONSET OSTEOARTHRITIS; CCAL2 


CLN8 


GDB:252118 


EPILEPSY, PROGRESSIVE, WITH MENTAL 
RETARDATION; EPMR 


CMT4A 


GDB: 138755 


CHARCOT-MARIE-TOOTH NEUROPATHY 
4A; CMT4A 
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CNGB3 






L-Urll 


kjDd.ZjZIZZ 




CPP 


GDB: 119798 


CERULOPLASMIN; CP 


CRH 


GDB: 119804 


CORTICOTROPDSr-RELEASING HORMONE; 
CRH 


CYPllBl 


GDB: 120603 


ADRENAL HYPERPLASIA, CONGENITAL, 
DUE TO 1 1-@BETA-HYDROXYLASE 
DEFICIENCY 


CYP11B2 


GDB: 1205 14 


CYTOCHROME P450, SUBFAMILY XIB, 

POLYPEPTIDE 2; CYPl lB2 


DECRl 


GDB:453934 


2,4-@DIENOYL-CoA RJEDUC T AbE; DECK 


DPYS 


GDB:S88S803 


DmYDROPYRIMIDINASE; DPYS 


DURSl 


GDB:9958126 


DUANE SYNDROME 


EBSl 


GDB:1 19856 


EPIDERMOLYSIS BULLOSA SIMPLEX, OGNA 
TYPE 


A 1 


GDB:lU7yo3i5 


JUVh-NlLii ADOiiJNCri 


EGI 


GDB:128830 


EPILEPSY, GENERALIZED, IDIOPATHIC; EGI 


EXTl 


GDB: 135994 


EXOSTOSES, MULTIPLE, TYPE I; EXTl 
CHONDROSARCOMA 


EYAl 


GDB:5215167 


BRANCHIOOTORENAL DYSPLASIA EYES 
ABSENT 1;EYA1 
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FGFRl 


GDB: 119913 


ACROCEPHALOSYNDACTYLY TYPE V 
FIBROBLAST GROWTH FACTOR 
RECEPTOR-1; FGFRl 


GNRHl 


GDB: 133746 


GONADOTROPIN-RELEASING HORMONE 1; 
rjKTPHI PAMTT TAT HYPOGONAr>OTROPHIC 


GSR 






GULOP 


GDB: 128078 


SCURVY 


HR 


GDB:595499 


ALOPECIA UNIVERSALIS ATRICHIA WITH 
PAPULAR LESIONS HAIRLESS, MOUSE, 
HOMOLOG OF 


KCNQ3 


GDB:9787230 


CONVULSIONS, BENIGN FAMILIAL 
NEONATAL, TYPE 2; BFNC2 POTASSIUM 
CHANNEL, VOLTAGE-GATED, SUBFAMILY 

V^, JVLXIJVLdI^IV. J 


KFM 


ODB:2o52yi 




KWE 


GDB:93i5120 


vt:j> AT*r\x vttp" wrrxiXTni? tjt? VTWT?\yf a 
ivtiCA 1 V-/i-»y 1 lU W IJN 1 lixC Hix I 1 riIiiYL/\ 


LGCR 


GDB:120o9o 




LPL 


vjJ-/x>. 




MCrrll 




IVifTr'POPPPH AT V* MPT 


MOS 


GDB: 11 9396 


TRANSFORMATION GENE: ONCOGENE MOS 


MYC 


GDB: 120208 


TRANSFORMATION GENE: ONCOGENE 
MYC; MYC 
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XT ATI 

NAT! 




ARYT AMIDE ACETYLASE 1- AACl 


IN /VIZ 


VJL/O. lADjyxjJ 


T^ONT A7TD TNACTIVATION 




vjijD.yjyozi 1 


MTTMFriFN BREAKAGE SYNDROME 


PLAT 


GDB: 119496 


PLASMINOGEN ACTIVATOR, TISSUE; PLAT 


PLECl 


GDB:41 19073 


EPIDERMOLYSIS BULLOSA SIMPLEX AND 
LIMB-GIRDLE MUSCULAR DYSTROPHY 
PLECTIN 1; PLECl 


PRKDC 


GDB:234702 


SEVERE COMBlNbL) IMMUNUlJiir lULtiiNL^i 
DISEASE.l; SCIDl PROTEIN KINASE, 
DNA- ACTIVATED, CATALYTIC SUBUNIT; 
PRKDC 


PXMP3 


GDB:131487 


PEROXIN-2; PEX2 ZELLWEGER SYNDROME; 


RPl 




isjD 1 IIN 1 1 lo irlvJiYlniN 1 KJ^JX' 1 , jvr i 


SCZD6 


GDB:9864736 


DISORDER-2; SCZD2 


SFTPC 


GDB: 120373 


PULMONARY SURFACTANT APOPROTEIN 


SGMl 


GDB: 135350 


KLIPPEL-FEIL SYNDROME; KFS; KFM 


SPG5A 


GDB:250332 


SPASTIC PARAPLEGIA-5A, AUTOSOMAL 
RECESSIVE; SPG5A 


STAR 


GDB:635457 


STEROIDOGENIC ACUTE REGULATORY 
PROTEIN; STAR 
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TG 


GDB: 120434 


1 HYKUCjLUoUi-liN, Ivj 


TRPSl 


GDB:594960 


TRICHORHINOPHALANGEAL SYNDROME, 
TYPE I; TRPS 1 


TTPA 


GDB:5 12364 


VITAMIN E, FAMILIAL ISOLATED 
DEFICIENCY OF; VED TOCOPHEROL 
(ALFHA) TKANorJbK rKUltlXN, 1 Ix'A 


VMDl 


GDB: 11 9631 


MACULAR DYSTROPHY, ATYPICAL 
VliELLlFORM; VMDl 


WRN 


GDB:128446 


WERNER SYNDROME; WRN 


Table 1 1 : Genes, Locations and Genetic Disorders on Chromosome 9 


Gene 


GDB AccessionID 


OMIM Link 


ABCAl 


GDB:305294 


ANALPHALIPOPROTEINEMIA ATP-BINDING 
CASSETTE!; ABCl 


ABLl 


GDB: 119640 


ABELSON MURINE LEUKEMIA VIRAL 
ONCOGENE HOMOLOG 1; ABLl 


ABO 


GDB: 11 8956 


ABO BLOOD GROUP; ABO 


ADAMTS13 


GDB:9956467 


THROMBOCYTOPENIC PURPURA 


AKl 


GDB: 119664 


ADENYLATE KINASE-1; AKl 


ALAD 


GDB: 119665 


DELTA-AMINOLEVULINATE 
DEHYDRATASE; ALAD 


ALDHlAl 


GDB:119667 


ALDEHYDE DEHYDROGENASE- 1; ALDHl 
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AT noR 


rSDR- 119669 


FRUCTOSE INTOLERANCE, HEREDITARY 


AMBP 


GDB: 120696 


PROTEIN HC; HCP 


AMCDl 


GDB:437519 


ARTHROGRYPOSIS MULTIPLEX 
rONGENITA DISTAL TYPE 1: AMCDl 


ASS 


GDB:119010 


CITRULLINEMIA 


BDMF 


GDB:9954424 


BONE DYSPLASIA WITH MEDULLARY 


BSCL 


GDB:9957720 


SEIP SYNDROME 


C5 


GDB: 119734 


COMPLEMENT COMPONENT-5, DEFICIENCY 
OF 


CDKN2A 


GDB:335362 


MELANOMA, CUTANEOUS MALIGNANT, 2; 
CMM2 CYCLIN-DEPENDENT KINASE 






PWDRFOACANTHOCYTOSIS- CHAC 






rARTILAGE-HAIR HYPOPLASIA' CHH 


CMDIB 


GDB:677147 


CARDIOMYOPATHY, DILATED IB; CMDIB 


COL5A1 


GDB:131457 


COLLAGEN, TYPE V, ALPHA- 1 
POLYPEPTIDE- COL5A1 

X V^X^ A A A^A A XX^X^y 


CRAT 


GDB:359759 


CARNITINE ACETYLTRANSFERASE; CRAT 


DBH 


GDB: U 9836 


DOPAMINE BETA-HYDROXYLASE, 
PLASMA; DBH 



- 161 - 



wo 2004/001010 PCT/US2003/019760 



DFNfBU 


GDB: 12201 80 


DEAFNESS, NEUROSENSORY, AUTOSOMAL 
RECESSIVE, 7; DFNB7 


DFNB7 


GDB:636178 


DEAFNESS, NEUROSENSORY, AUTOSOMAL 
RECESSIVE, 7; DFNB7 




GDB- 11500297 


IMMOTILE CILIA SYNDROME-1; ICSl 




GDB* 137085 


DYSAUTONOMIA FAMILIAL; DYS 


xJ 111 




DYSTONIA 1 TORSION- DYTl 


ENG 


GDB:137193 


ENDOGLIN; ENG 


EPB72 


GDB: 128993 


ERYTHROCYTE SURFACE PROTEIN BAND 
7.2; EPB72 STOMATOCYTOSIS I 


FANCC 


GDB: 132672 


FANCONI ANEML\, COMPLEMENTATION 
GROUP C* FACC 


FBPl 


GDB:141539 


FRUCTOSE-l,6-BISPHOPHATASE 1; FBPl 


FCMD 


GDB:250412 


FUKUYAMA-TYPE CONGENITAL 
MUSCULAR DYSTROPHY- FCMD 


FRDA 


GDB: 119951 


FRIEDREICH ATAXIA 1; FRDAl 


GALT 


GDB: 119971 


GALACTOSEML\ 


GLDC 


GDB: 128611 


HYPERGLYCINEMIA, ISOLATED 
NONKETOTIC, TYPE I; NKHl 


GNE 


GDB:9954891 


INCLUSION BODY MYOPATHY; IBM2 
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\JtjLYX M. 


GDB-9784210 


GENIOSPASM 1; GSMl 


GSN 


GDB:120019 


AMYLOIDOSIS V GELSOLIN; GSN 


HSD17B3 


GDB:347487 


PSEUDOHERMAPHRODITISM. MALE, WITH 
GYNECOMASTIA 


HSNl 


GDB:3853677 


NEUROPATHY, HEREDITARY SENSORY, 

TVPF 1 


IBM2 


GDB:3801447 


INCLUSION BODY MYOPATHY; IBM2 


LALL 


GDB:9954426 


LEUKEML\, ACUTE, WITH 
LYMPHOMATOUS FEATURES; LALL 


LCCS 


GDB:386141 


LETHAL CONGENITAL CONTRACTURE 






nVSITRnPHV HI JTTFRTTF TYPE 

X O X .^Vi/x JTX X y XX kJ X X X^XVl X X^ t X X X^ 


LMXIB 


GDB:9834526 


NAIL-PATELLA SYNDROME; NPSl 


MLLT3 


GDB:138172 


MYELOID/LYMPHOID OR MIXED LINEAGE 
TFIIKFMTA TRANSLOCATED TO 3- MLLT3 




fiDB-9954430 


MELKERSSON SYNDROME 


MSSE 


GDB:128019 


EPITHELIOMA, SELF-HEALING SQUAMOUS 


NOTCHl 


GDB:131400 


NOTCH, DROSOPHILA, HOMOLOG OF, 1; 
NOTCHl 


ORMl 


GDB: 120250 


OROSOMUCOID 1; ORMl 
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PAPPA 


GDB:134729 


PREGNANCY-ASSOCIATED PLASMA 
PROTEIN A; PAPPA 


PIP5K1B 


GDB:686238 


FRIEDREICH ATAXIA 1 ; FRDAl 


PTCH 


GDB: 119447 


BASAL CELL NEVUS SYNDROME; BCNS 
PATCHED, DROSOPHILA, HOMOLOG OF; 
PTCH 


PTGSl 


GDB: 128070 


PROSTAGLANDIN-ENDOPEROXIDASE 
SYNTHASE 1- PTGSl 


rvL/XN 1 


nr»R'i loss? 


RELAXIN- RLNl 


RLN2 


GDB: 119553 


RELAXIN, OVARIAN, OF PREGNANCY 


RMRP 




iVli 1 Wi^nwlN J_/IVL^\J_^ lvlN/\-x lvV^\^Iii3i31iN\j 

ENDORIBONUCLEASE, RNA COMPONENT 
OF; RMRP; CARTELAGE-HAIR HYPOPLASIA; 
CHH 


ROR2 


GDB: 1 35454 


t>T> A r^x-rvT^ A onrvT v x vt>p "R - qpiq p OtlFMOW 
JtsKAL-rl Y UJ\\^ 1 1 1^ i , 1 i re, o, DUO tsXJOlriKJ w 

SYNDROME, RECESSIVE FORM 
NEUROTROPHIC TYROSINE KINASE, 
RECEPTOR-RELATED 2; NTRKR2 


RPDl 


GDB:9954440 


RETINITIS PIGMENTOSA-DEAFNESS 
SYNDROME 1 AUTOSOMAL DOMINANT 




GDB-9835149 


SARCOSINEMIA 


TDFA 


GDB:9954420 


FACTOR, AUTOSOMAL 


TEK 


GDB:344185 


VENOUS MALFORMATIONS, MULTIPLE 
CUTANEOUS AND MUCOSAL; VMCM TEK 
TYROSE^JE KINASE, ENDOTHELIAL; TEK 
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TSCl 


GDB: 120735 


TUBEROUS SCLEROSIS- 1; TSCl 


TYRPl 


GDB: 126337 


TYROSINASE-RELATED PROTEIN 1; TYRPl 
ALBINISM III XANTHISM 


XPA 


GDB: 125363 


XERODERMA PIGMENTOSUM I 



Table 12: Genes, Locations and Genetic Disorders on Chromosomes 10 



Gene 


GDB Accession ID 


OMIM Link 


CACNB2 


GDB: 132014 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, BETA-2 
SUBUNIT; CACNB2 


COL17A1 


GDB:131396 


COLLAGEN, TYPE XVII, ALPHA-1 
POLYPEPTIDE; COL17A1 


CUBN 


GDB:636049 


MEGALOBLASTIC ANEMIA 1; MGAl 


CYP17 


GDB: 119829 


ADRENAL HYPERPLASIA, 
CONGENITAL, DUE TO 
1 7- ALPHA-HYDROXYLASE 
DEFICIENCY 


CYP2C19 


GDB: 119831 


CYTOCHROME P450, SUBFAMILY 
nC, POLYPEPTIDE 19; CYP2C19 


CYP2C9 


GDB: 13 1455 


CYTOCHROME P450, SUBFAMILY 
nC, POLYPEPTIDE 9; CYP2C9 


EGR2 


GDB: 1206 11 


EARLY GROWTH RESPONSE-2; EGR2 


EMX2 


GDB:277886 


EMPTY SPIRACLES, DROSOPHILA, 2, 
HOMOLOG OF; EMX2 
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EPT 


GDB:9786112 


EPILEPSY, PARTIAL; EPT 


ERCC6 


GDB: 119882 


EXCISION-REPAIR 
CROSS-COMPLEMENTING RODENT 
REPAIR DEFICIENCY. 
COMPLEMENTATION 


FGFR2 


GDB: 127273 


ACROCEPHALOSYNDACTYLY TYPE 
V FIBROBLAST GROWTH FACTOR 
RECEPTOR-2; FGFR2 


HKl 


GDB: 120044 


HEXOKINASE-1; HKl 


HOXll 


GDB: 119607 


HOMEO BOX-1 1; HOXl 1 


HPS 


GDB: 127359 


HERMANSKY-PUDLAK SYNDROME; 
HPS 


IL2RA 


GDB: 119345 


INTERLEUKIN-2 RECEPTOR, ALPHA; 
IL2RA 


LGIl 


GDB:9864936 


EPILEPSY, PARTIAL; EPT 


LIPA 


GDB: 1201 53 


WOLMAN DISEASE 


MATIA 


GDB: 129077 


METHIONINE 

ADENOSYLTRANSFERASE 

DEFICIENCY 


MBL2 


GDB:120167 


MANNOSE-BEMDING PROTEIN, 
SERUM; MBPl 


MKI67 


GDB: 1201 85 


PROLIFERATION-RELATED Ki-67 
ANTIGEN; MKI67 


MXIl 


GDB: 1371 82 


MAX INTERACTING PROTEIN 1 ; 
MXIl 
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OAT 


GDB: 120246 


ORNITHINE AMINOTRANSFERASE 
DEFICIENCY 


OATL3 


GDB:2 15803 


ORNITHINE AMINOTRANSFERASE 
DEFICIENCY 


PAX2 


GDB: 138771 


PAIRED BOX HOMEOTIC GENE 2; 
PAX2 


PCBD 


GDB: 138478 


r 

PTERIN-4-ALPHA-CARBINOLAMINE 
DEHYDRATASE; PCBD 
PRIMAPTERINURIA 


PEOl 


GDB:632784 


PROGRESSIVE EXTERNAL 
OPHTHALMOPLEGIA; PEO 


PHYH 


GDB:9263423 


REFSUM DISEASE PHYTANOYL-CoA 
HYDROXYLASE; PHYH 


PNLIP 


GDB: 127916 


LIPASE, CONGENITAL ABSENCE OF 
PANCREATIC 


PSAP 


GDB: 120366 


PROSAPOSIN; PSAP 


PTEN 


GDB:6022948 


MACROCEPHALY. MULTIPLE 
LIPOMAS AND HEMANGIOMATA 
MULTIPLE HAMARTOMA 
SYNDROME; MHAM POLYPOSIS, 
JUVENILE INTESTINAL 
PHOSPHATASE AND TENSIN 
HOMOLOG; PTEN 


RBP4 


GDB: 120342 


RETINOL-BINDING PROTEIN, 
PLASMA; RBP4 


RDPA 


GDB:9954445 


REFSUM DISEASE WITH INCREASED 
PIPECOLICACIDEMIA; RDPA 
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RET 


GDB: 120346 


RET PROTO-ONCOGENE; RET 


SDFl 


GDB:433267 


STROMAL CELL-DERIVED FACTOR 
1;SDF1 


SFTPAl 


GDB: 119593 


PULMONARY SURFACTANT 
APOPROTEIN PSP-A; PSAP 


SFTPD 


GDB: 132674 


PULMONARY SURFACTANT 
APOPROTEIN PSP-D; PSP-D 


SHFM3 


GDB:386030 


SPLIT-HAND/FOOT 
MALFORMATION, TYPE 3; SHFM3 




GDB-6549924 


NEURAMINIDASE DEFICIENCY 


THC2 


GDB- 10794765 


THROMBOCYTOPENIA 




GDB" 132671 


APOPTOSIS ANTIGEN 1; APTl 


UFS 


GDB:6380714 


UROFACIAL SYNDROME; UFS 


UROS 


GDB:128112 


PORPHYRIA. CONGENITAL 
ERYTHROPOIETIC; CEP 



Table 13: Genes, Locations and Genetic Disorders on Chromosome 1 1 



Gene 


GDB Accession ID 


OMIM Link 


AA 


GDB: 568984 


ATROPHL^ AREATA; AA 


ABCC8 


GDB:591370 


SULFONYLUREA RECEPTOR; SUR 
PERSISTENT HYPERINSULINEMIC 
HYPOGLYCEMIA OF INFANCY 
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AC* ATI 




AT PHA-MFTHYXACETOArETICACroURIA 


ALX4 


GDB: 10450304 


PARIETAL FORAMINA, SYMMETRIC; PFM 


AMPD3 


GDB: 136013 


ADENOSINE MONOPHOSPHATE 


ANC 


GDB:9954484 


CANAL CARCINOMA 


APOAl 


GDB: 119684 


AMYLOEDOSIS, FAMILIAL VISCERAL 
APOLIPOPROTEIN A-I OF HIGH DENSITY 

l^ur\Jrx\\J 1 lllIN , /Vr vJ/\ 1 


ArUA4 


kjUiS, 1 iy\)\)K) 










AIM 




ATAYTA Xt7T AXinTT7P*X A QT A • AX 


BSCL2 


GDB:9963996 


SEIP SYNDROME 


BWS 


GDB: 120567 


BECKWITH- WIEDEMANN SYNDROME; 
BWS 


CALCA 


GDB: 120571 


CALCITONIN/CALCITONIN-RELATED 

POT VPFPTTT^P ATPHA-PAir'A 


CAT 


GDB: 119049 


CATALASE; CAT 


CCNDl 


GDB: 128222 


LEUKEMIA, CHRONIC LYMPHATIC; CLL 
CYCLINDl; CCNDl 


CD3E 


GDB:1 19764 


CD3E ANTIGEN, EPSILON POLYPEPTIDE; 
CD3E 
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CD3G 


GDB:119765 


T3 T-CELL ANTIGEN, GAMMA CHAIN; T3G; 
CD3G 


CD59 


GDB: 119769 


CD59 ANTIGEN PI 8-20; CD59 HUMAN 
LEUKOCYTE ANTIGEN MICH; MICl 1 


CDKNIC 


GDB:593296 


CYCLIN-DEPENDENT KINASE INHIBITOR 
IC; CDKNIC 


CLN2 


GDB: 125228 


CEROID-LIPOFUSCINOSIS, NEURONAL 2, 
LATE INFANTILE TYPE* CLN2 


CNTF 


GDB: 1259 19 


CILIARY NEUROTROPHIC FACTOR; CNTF 


/^"mri A 
CFl lA 




DEFICIENCY OF CARNITINE 
PALMITOYLTRANSFERASE CARNITINE 
PALMITOYLTRANSFERASE I, LIVER; CPTIA 


CTSC 


GDB:642234 


KERATOSIS PALMOPLANTARIS WITH 

r^lllvlVJJJ wiN 1 \JrJ\ 1 xli/\ JSJlxVf \ l ^olo 

PALMOPLANTARIS WITH 
PERIODONTOPATHL\ AND 
ONYCHOGRYPOSIS CATHEPSIN C: CTSC 




GDB-595014 


DNA DAMAGE-BINDING PROTEIN: DDBl 


DDB2 


GDB-595015 


DNA DAMAGE-BINDING PROTEIN-2; DDB2 


DHCR7 


GDB'9835302 


SMITH-LEMLI-OPITZ SYNDROME 


DLAT 


GDB- 11 8785 


CIRRHOSIS PRIMARY; PBC 


DRD4 


GDB: 127782 


DOPAMINE RECEPTOR D4; DRD4 


ECB2 


GDB:9958955 


POLYCYTHEML\, BENIGN FAMILL\L 
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ED4 


GDB:9837373 


DYSPLASIA, MARGARITA TYPE 


EVRl 


GDB: 134029 


liXUDAilVr. VllKxUKbl uNUrAlrlY, 
FAMILIAL; EVR 

EXT2GDB:344921 EXOSTOSES, MULTIPLE, 


F2 


GDB:119894 


COAGULATION FACTOR II; F2 


FSHB 


GDB: 119955 


FOLLICLE-STIMULATING HORMONE, BETA 

T>/~|T -VrOUTJ' 1 1 1 1 VTJ. UCIJO 

FLILYi'lllrlllJn,; rdrlD 


FTHl 


GDB:120617 


FERRITIN HEAVY CHAIN 1; FTHl 


GIF 


GDB: 118800 


PERNICIOUS ANEMIA, CONGENITAL, DUE 

IvJUlirilL*! VJr IJN IKllNolv^ JrAv^l vyK. 


(j^U ID 






i^O"r\i 




o 1 LiKAvjrxi Llloi:.Aoi:i IC 


xIdo 


ijrL>i5: 1 lyzy / 


JrlxiML/^jLL>DJJN--oxll A i^Liv^Uo, xIdd 


TJT>"D"D 1 


rf^TMi • 1 onni ^ 
LrJJo. 1 ZUUJD 


iliiJVl\JVjl-ClolJN — DC 1 A l^UI^Uo, rlDo 






WPAifr^riT ORTM T^PT TAT C%^^^^' TTOFI 

nuivi wo iTN — uxjL^ 1 r\. L^xjK^KjOj nou 




CXT\Vl* \ 1 Q900 


xiiiivi^vJi-fwDiiN — jlh^oii-*vJ1n L^KJK^Kj^y noci 


rll5vjl 


\j\ji>, 1 1 y juu 


T-TPAvfl^nT r\TJTM n A \A\A A A • TTOl^l 
ITJilVl wOi^L/x5 UN , OAlVliYlA A, XloOl 


HBG2 


GDB-1 19301 


HEMOGLOBIN GAMMA G- HBG2 


HMBS 


GDB:120528 


PORPHYRIA, ACUTE INTERMITTENT; AIP 
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HARTMI TP OTSORDFR 


HOMG2 


GDB:9956484 


MAGNESIUM WASTING, RENAL 


HRAS 


GDB: 120684 


BLADDER CANCER V-HA-RAS HARVEY 
RAT SARCOMA VIRAL ONCOGENE 


HVBSl 


GDB: 120069 


CANCER. HEPATOCELLULAR 


IDDM2 


GDB: 128530 


DIABETES MELLITUS, 
INSULIN-DEPENDENT, 2 DIABETES 
MELLITUS, JUVENILE-ONSET 




vjJJo.l I yoyo 




TTvTC 




TXTQT TT TM- TXTQ 
ilNoUJLlXN, UNO 


JBS 


GDB: 1201 11 


JACOBSEN SYNDROME; JBS 


KCNJll 


GDB:7009893 


POTASSIUM CHANNEL, 
INWARDLY-RECTIFYING, SUBFAMILY J, 
MEMBER 1 1 ; KCNJl 1 PERSISTENT 
HYPERINSULINEMIC HYPOGLYCEMIA OF 
INFANCY 


KCNJl 


GDB:204206 


POTASSIUM CHANNEL, 
INWARDLY-RECTIFYING, SUBFAMILY J, 








LDHA 


GDB: 120141 


LACTATE DEHYDROGENASE-A; LDHA 


LRP5 


GDB:9836818 


OSTEOPOROSIS-PSEUDOGLIOMA 
SYNDROME; OPPG fflGH BONE MASS 
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MENl 


GDB:120173 


MULTIPLE ENDOCRINE NEOPLASIA, TYPE 
1;MEN1 


MLL 


GDB:128819 


MYELOID/LYMPHOID OR MDCED-LINEAGE 
LEUKEMIA; MLL 


MTACRl 


GDB: 125743 


MULTIPLE TUMOR ASSOCIATED 
CHROMOSOME REGION 1; MTACRl 


ivx X DrK^j 


KJXJD.D / 70 i D 


A I> T^TOlVirVOP A XTXV T? A \4TT TAT 

l^/Ai\JLylvJlVl I yJr/\ i rl I , r AJVIILIAL 

HYPERTROPHIC, 4; CMH4 
MYOSIN-BINDING PROTEIN C, CARDIAC; 
MYBPC3 


MY07A 


GDB: 132543 


MYOSIN VIIA; MY07A DEAFNESS, 

XTT7T TOOCTJXTCrf^O V A T TXT^C/^Xyf A T 

INIiUKlJoJilNoLIKY, AU HJoCJMAL. 
RECESSIVE, 2; DFNB2 DEAFNESS, 
AUTOSOMAL DOMINANT NONSYNDROMIC 
SENSORINEURAL 1 1 - DFNAl 1 


NNOl 


GDB:10450513 


SIMPLE, AUTOSOMAL DOMINANT 


OPPG 


GDB:3789438 


OSTEOPOROSIS-PSEUDOGLIOMA 
SYNDROME- OPPG 

\^ X X ^ X>^XX,V^X.TXX^ , V^X X \_I 


OPTBl 


GDB:9954474 


OSTEOPETROSIS AUTOSOMAT RFPFSSIVF 


PAX6 


GDB: 11 8997 


PAIRED BOX HOMEOTIC GENE 6- PAX6 


PC 


GDB: 11 9472 


PYRUVATE CARBOXYLASE DEFICIENCY 


PDXl 


GDB:9836634 


PYRUVATE DEHYDROGENASE COMPLEX, 
COMPONENT X 


PGL2 


GDB:511177 


PARAGANGLIOMAS, FAMILL\L 
NONCHROMAFFIN, 2; PGL2 
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PGR 


GDB:119493 


PROGESTERONE RESISTANCE 


PORC 


GDB:128610 


PORPHYRIA, CHESTER TYPE; PORC 


PTH 


GDB:1 19522 


PARATHYROID HORMONE; PTH 


PTS 


GDB:118856 


6-@PYRUVOYLTETRAHYDROPTERIN 
SYNTHASE; PTS 






ECTODERMAL DYSPLASIA, CLEFT LIP AND 
PALATE, HAND AND FOOT DEFORMITY, 
DYSPLASL\, MARGARITA TYPE 
POUOVIRUS RECEPTOR RELATED; PVRR 


PYGM 


GDB: 120329 


GLYCOGEN STORAGE DISEASE V 


RAGl 


GDB: 120334 


RECOMBINATION ACTIVATING GENE-1; 
RAGl 


RAG2 


GDB:125186 


RECOMBINATION ACTIVATING GENE-2; 
RAG2 


ROMl 


GDB: 120350 


ROD OUTER SEGMENT PROTEIN-1; ROMl 


SAAl 


GDB: 120364 


SERUM AMYLOID Al; SAAl 


SCA5 


GDB:378219 


SPINOCEREBELLAR ATAJOA 5; SCA5 


SCZD2 


GDB: 11 8874 


DISORDER-2; SCZD2 


SDHD 


GDB: 132456 


PARAGANGLIOMAS, FAMILIAL 
NONCHROMAFFIN. 1; PGLl 


SERPINGl 


GDB: 119041 


ANGIONEUROTIC EDEMA, HEREDITARY; 
HANE 
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oik jrT%r\ t 

SMPDI 






IClRCjl 






TCL2 


GDB:9954468 


LEUKEMIA, ACUTE T-CELL; ATL 


TECTA 


GDB:6837718 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 8; 
DFNA8 DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 12; DFNAi2 


TH 


GDB: 119612 


TYROSINE HYDROXYLASE; TH 


TREH 


GDB:9958953 


T^TIT^TT AT A O T** 

TREHALASE 


TSGlOl 


GDB:1313414 


TUMOR SUSCEPTIBILI 1 Y CjEinE 101; loolOl 


TYR 


GDB: 120476 


A T TITXTTO"* M T 

ALBINISM I 


USHIC 


GDB: 132544 


USHER SYNDROME, TYPE IC; USHIC 


VMD2 


GDB: 133795 


VITELLIFORM MACULAR DYSTROPHY; 
VMD2 


VRNI 


GDB: 135662 


VITREORETINOPATHY, NEOVASCULAR 
UNrLAMMAHJKi; VKlNl 


WTl 


GDB: 120496 


FRASIER SYNDROME WILMS TUMOR; WTl 


WT2 


GDB: 118886 


MULTIPLE TUMOR ASSOCL\TED 
CHROMOSOME REGION 1; MTACRl 


ZNF145 


GDB:230064 


PROMYELOCYTIC LEUKEMIA ZINC 
FINGER; PLZF 
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Table 14: Genes, Locations and Genetic Disorders on Chromosome 12 



Gene 


GDB Accession ID 


OMIM Link 


A2M 


GDB: 119639 


ALPHA-2-MACROGLOBULIN; A2M 


AAAS 


GDB:9954498 


GLUCOCORTICOID DEFICIENCY AND 
ACHALASL\ 


ACADS 


GDB: 118959 


ACYL-CoA DEHYDROGENASE. 
SHORT-CHAIN; ACADS 


ACLS 


GDB: 136346 


ACROCALLOSAL SYNDROME; ACLS 


ACVRLl 


GDB:230240 


OSLER-RENDU-WEBER SYNDROME 2; 
ORW2 ACTIVIN A RECEPTOR, TYPE 
n-LKE KINASE 1; ACVRLl 


ADHR 


GDB:9954488 


VITAMIN D-RESISTANT RICKETS, 
AUTOSOMAL DOMINANT 


ALDH2 


GDB: 119668 


ALDEHYDE DEHYDROGENASE-2; 
ALDH2 


AMHR2 


GDB:696210 


ANTI-MULLERIAN HORMONE TYPE II 
RECEPTOR; AMHR2 


AOM 


GDB:1 18998 


STICKLER SYNDROME, TYPE I; STLl 


AQP2 


GDB: 141 853 


AQUAPORIN-2; AQP2 DIABETES 
INSIPIDUS, RENAL TYPE DIABETES 
INSIPIDUS, RENAL TYPE, 
AUTOSOMAL RECESSIVE 


ATD 


GDB:696353 


ASPHYXIATING THORACIC 
DYSTROPHY; ATD 
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ATP2A2 


GDB:U9717 


ATPase, Ca(2+)-TRANSPORTING, 
SLOW-TWITCH; ATP2A2 
DARIER-WHITE DISEASE; DAR 


BDC 


GDB:5584359 


BRACHYDACTYLY, TYPE C; BDC 


CIR 


GDB: 119729 


COMPLEMENT COMPONENT-Clr, 
DEFICIENCY OF 


CD4 


GDB:119767 


T-CELL ANTIGEN T4/LEU3; CD4 


CDK4 


GDB:204022 


CYCLIN-DEPENDENT KINASE 4; CDK4 


CNAl 


GDB:252119 


CORNEA PLANA 1; CNAl 


COL2A1 


GDB: 119063 


STICKLER SYNDROME, TYPE I; STLl 
COLLAGEN, TYPE II, ALPHA- 1 CHAIN; 
COL2A1 ACHONDROGENESIS, TYPE 
II; ACG2 


CYP27B1 


GDB:9835730 


PSEUDOVITAMDSr D DEFICIENCY 
RICKETS; PDDR 


DRPLA 


GDB:270336 


DENTATORUBRAL-PALLIDOLUYSLAN 
ATROPHY; DRPLA 


ENUR2 


GDB:666422 


ENURESIS, NOCTURNAL, 2; ENUR2 


FEOMl 


GDB:345037 


FIBROSIS OF EXTRAOCULAR 
MUSCLES, CONGENITAL; FEOM 


FPF 


GDB:9848880 


PERIODIC FEVER, AUTOSOMAL 
DOMINANT 


GNB3 


GDB: 120005 


GUANINE NUCLEOTIDE-BINDING 
PROTEIN, BETA POLYPEPTIDE-3; 
GNB3 
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GNS 


GDB: 120006 


1 «- 11 1 " ■• -.1 m.nT .mf .- irm .1- .r t- --n- 

MUCOPOLYSACCHARIDOSIS TYPE 
HID 


HAL 


GDB: 120746 


HISTIDINEMIA 


HBPl 


GDB:701889 


BRACHYDACTYLY WITH 
HYPERTENSION 


HMGIC 


GDB:362658 


HIGH MOBILITY GROUP PROTEIN 
ISOFORM I-C; HMGIC 


HMN2 


GDB:9954508 


MUSCULAR ATROPHY, ADULT 
SPINAL 


HPD 


GDB:135978 


TYROSINEML\, TYPE III 


IGFl 


GDB: 120081 


INSULINLIKE GROWTH FACTOR 1; 
IGFl 


KCNAl 


GDB: 127903 


POTASSIUM VOLTAGE-GATED 
CHANNEL, SHAKER-RELATED 
SUBFAMILY, MEMBER 


KERA 


GDB:252121 


CORNEA PLANA 2; CNA2 


KRAS2 


GDB:120120 


V-KI-RAS2 KIRSTEN RAT SARCOMA 2 
VIRAL ONCOGENE HOMOLOG; 
KRAS2 


KRTl 


GDB:128198 


KERATIN 1;KRT1 


KRT2A 


GDB:407640 


ICHTHYOSIS, BULLOUS TYPE 
KERATIN 2A; KRT2A 


KRT3 


GDB: 136276 


KERATIN 3; KRT3 
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KRT4 


GDB: 120697 


KERATIN 4; KRT4 


KRT5 


GDB:128110 


EPIDERMOLYSIS BULLOSA 
HERPETIFORMIS, DOWLING-MEARA 


KRT6A 


GDB:12811l 


KERATIN 6A; KRT6A 


KRT6B 


GDB:128113 


KERATIN 6B; KRT6B PACHYONYCHIA 
CONGENITA, JACKSON-LA WLER 
TYPE 


KRTHB6 


GDB:702078 


MONILETHRIX KERATIN, HAIR BASIC 
(TYPE II) 6 


LDHB 


GDB:120147 


LACTATE DEHYDROGENASE-B; 
LDHB 


LYZ 


GDB:120160 


AMYLOIDOSIS, FAMILIAL VISCERAL 






TTT CTT/^T TT A "D TT TTi>f /^I> C 

1 xlo 1 ICU J^AK i UJVlvJKo 








MVK 


GDB: 134 189 


MEVALONICACmURIA 


MYL2 


GDB: 128829 


MYOSIN, LIGHT CHAIN. 


NSl 


GDB:439388 


NOONAN SYNDROME 1; NSl 


OAP 


GDB: 120245 


OSTEOARTHROSIS, PRECOCIOUS; 
OAP 
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PAH 


GDB:119470 


PHENYLKETONURIA; PKUl 


PPKB 


GDB:696352 


PALMOPLANT AR KERATODERMA, 
BOTHNIAN TYPE; PPKB 


PRB3 


GDB: 119513 


PAROTID SALIVARY 
GLYCOPROTEIN; Gl 


PXRl 


GDB:433739 


ZELLWEGER SYNDROME; ZS 
PEROXISOME RECEPTOR 1; PXRl 


RLS 


GDB: 1 1501392 


ACROMELALGM., HEREDITARY 


RSN 


GDB:139158 


RESTIN; RSN 


SAS 


GDB: 128054 


SARCOMA AMPLIFIED SEQUENCE; 
SAS 


SCA2 


GDB: 128034 


SPINOCEREBELLAR ATAXIA 2; SCA2 
ATAXIN-2; ATX2 


SCNNIA 


GDB:366596 


SODIUM CHANNEL, 
NONVOLTAGE-GATED, 1; SCNNIA 


SMAL 


GDB:9954506 


SPINAL MUSCULAR ATROPHY, 
CONGENITAL NONPROGRESSIVE, OF 
LOWER LIMBS 


SPPM 


GDB:9954502 


SCAPULOPERONEAL MYOPATHY; 
SPM 


SPSMA 


GDB:9954510 


SCAPULOPERONEAL AMYOTROPHY, 
NEUROGENIC, NEW ENGLAND TYPE 


TBX3 


GDB:681969 


ULNAR-MAMMARY SYNDROME; 
UMS T-BOX 3; TBX3 
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TBX5 


GDB:6175917 


HOLT-ORAM SYNDROME; HQS T-BOX 
5; TBX5 


TCFl 


GDB: 125297 


TRANSCRIPTION FACTOR 1 , 
HEPATIC; TCFl MATURITY-ONSET 
DIABETES OF THE YOUNG. TYPE III; 
MODY3 


TPIl 


GDB: 119617 


TRIOSEPHOSPHATE ISOMERASE 1; 
Irll 


TSC3 


GDB: 127930 




ULR 


GDB:594089 


UTERINE 


VDR 


GDB: 120487 


VITAMIN D-RESISTANT RICKETS 
WITH END-ORGAN 
UNRESPONSIVENESS TO 
1 ,25-DmYDROXYCHOLECALCIFEROL 


VWF 


GDB:119125 


VON WILLEBRAND DISEASE; VWD 



Table 15: Genes, Locations and Genetic Disorders on Chromosome 13 



Gene 


GDB Accession ID 


OMIM Link 


ATP7B 


GDB: 120494 


WILSON DISEASE; WND 


BRCA2 


GDB:387848 


BREAST CANCER 2, EARLY-ONSET; BRCA2 


BRCDl 


GDB:9954522 


BREAST CANCER, DUCTAL, 1; BRCDl 


CLN5 


GDB:230991 


CEROID-LIPOFUSCNOSIS, NEURONAL 5; 
CLN5 
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CPB2 


GDB: 129546 


CARBOXYPEPTIDASE B2, PLASMA; CPB2 


ED2 


GDB:9834522 


ECTODERMAL DYSPLASIA, HIDROTIC; 
HED 


EDNRB 


GDB: 129075 


ENDOTHELIN-B RECEPTOR; EDNRB 


ENURl 


GDB:594516 


ENURESIS, NOCTURNAL, 1; ENURl 


ERCC5 


GDB: 1205 15 


EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 5; 
ERCC5 


FIO 


GDB: 119890 


X, QUANTITATIVE VARIATION IN FACTOR 


F7 


GDB: 119897 


FACTOR Vn DEFICIENCY 


GJB2 


GDB: 125247 


GAP JUNCTION PROTEIN, BETA-2, 26 KD; 
GJB2 DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 1 ; DFNBl 
DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 3; 
DFNA3 


GJB6 


GDB:9958357 


ECTODERMAL DYoFLAblA, rlUJKUl IC; 
HED DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 3; 


urr 1 




rMQTTT rw PPOA/fOTFR FAf^TOR 1 • TPFl 
LLNdUl^ixN JrXvv-FlYlv-f I Civ 1 1 , li r 1 




GDB- 128365 


MOEBIUS SYNDROME- MBS 


MCOR 


GDB:9954520 


CONGENITAL 
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PCCA 


GDB:1 19473 


GLYCINEMIA, KETOTIC, I 


RBI 


GDB: 118734 


BLADDER CANCER RETINOBLASTOMA; 

1713 1 
KJUl 


KHUK. 




D lJ/^I~»^~»DO nvT V TXT A CC • 1> UOV 

KiiUlJUroiN JvUnAoii, KlUJJs. 


SCZD7 


GDB:9864734 


DISORDER-2; SCZD2 


SGCG 


GDB:3763329 


MUSCULAR DYSTROPHY, LIMB GIRDLE, 
TYPE 2C; LGMD2C 


SLC10A2 


GDB:677534 


SOLUTE CARRIER FAMILY 10, MEMBER 2; 
SLC10A2 


SLC25A15 


GDB: 120042 


HYPERORNITHINEMIA-HYPERAMMONEM 
L\-HOMOCITRULLINURIA SYNDROME 


STARPl 


GDB:635459 


STEROIDOGENIC ACUTE REGULATORY 


ZNF198 


GDB:6382650 


ZINC FINGER PROTEIN-198; ZNF198 



Table 16: Genes, Locations and Genetic Disorders on Chromosome 14 



Gene 


GDB Accession ID 


OMIM Link 


ACHMl 


GDB:132458 


COLORBLINDNESS, TOTAL 


ARVDl 


GDB:371339 


ARRHYTHMOGENIC RIGHT VENTRICULAR 
DYSPLASIA, FAMILIAL, 1; ARVDl 


CTAAl 


GDB:265299 


CATARACT, ANTERIOR POLAR 1; CTAAl 


DADl 


GDB:407505 


DEFENDER AGAINST CELL DEATH; DADl 
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DFNB5 


GDB:636176 


DEAFNESS, NEUROSENSORY, AUTOSOMAL 








GALC 


GDB: 119970 


KRABBE DISEASE 


GCHl 


GDB: 118798 


DYSTONIA, PROGRESSIVE, WITH DIURNAL 
VARIATION GTP CYCLOHYDROLASE I 
DEFICffiNCY GTP CYCLOHYDROLASE I; 
GCHl 


HEl 


GDB:9957680 


MALFORMATIONS, MULTIPLE, WITH LIMB 
ABNORMALITIES AND HYPOPITUITARISM 


IBGCl 


GDB: 10450404 


CEREBRAL CALCIFICATION, 
NONARTERIOSCLEROTIC 


1GH@ 


GDB: 118731 


IgA CONSTANT HEAVY CHAIN 1; IGHAl 
IMMUNOGLOBULIN: D (DIVERSITY) 
REGION OF HEAVY CHAIN IgA CONSTANT 
HEAVY CHAIN 2; IGHA2 
IMMUNOGLOBULIN: J (JOINING) LOCI OF 
HEAVY CHAIN; IGHJ IMMUNOGLOBULIN: 
HEAVY Mu CHAIN; Mul; IGHMl 
IMMUNOGLOBULIN: VARL\BLE REGION OF 
HEAVY CHArNS--Hvl; IGHV IgG HEAVY 
CHAIN LOCUS; IGHGl IMMUNOGLOBULIN 
Gm-2; IGHG2 IMMUNOGLOBULIN Gm-3; 
IGHG3 IMMUNOGLOBULIN Cjm-4; ICjHG4 
IMMUNOGLOBULIN: HEAVY DELTA CHAIN; 
IGHD IMMUNOGLOBULIN: HEAVY EPSILON 
CHAIN; IGHE 


IGHC group 


GDB:9992632 


IgA CONSTANT HEAVY CHAIN 1; IGHAl IgA 
CONSTANT HEAVY CHAIN 2; IGHA2 
IMMUNOGLOBULIN: HEAVY Mu CHAIN; 
Mul; IGHMl IgG HEAVY CHAIN LOCUS; 

IMMUNOGLOBULIN Gm-3; IGHG3 
IMMUNOGLOBULIN Gm-4; IGHG4 
IMMUNOGLOBULIN: HEAVY DELTA CHAIN; 
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IGHD IMMUNOGLOBULIN: HEAVY EPSILON 


IGHGl 


GDB: 120085 


IgG HEAVY CHAIN LOCUS; IGHGl 


IGHM 


GDB: 120086 


IMMUNOGLOBULIN: HEAVY Mu CHAIN; 
Mul; IQjrlMl 


lOHR 




LiI(Al) bYlNlDKUMJi 


IV 


GDB: 139274 


INVERSUS VISCERUM 


LTBP2 


GDB:453890 


LATENT TRANSFORMING GROWTH 

r AC 1 UK-oii 1 A oLNJLllJN vjr FKU 1 tills Z, L,l lirZ 










vjDorl Ioo4U 


MACHALIU-JUoiirrl JJlbilAatt; MJU 


MNCjI 


CjiJB:o54Uuo2 


CiUllbK, MULl IinUJJULAK 1; MNUl 


Mx'iJl 


/^"r\D .-Tl AO*?! 


M YLirAlrlY, i^Ali^ JJlol AL rlbKjbJJll AKY 






MUCL>rL^l-YoAv^v^llAJ<JiJL>51o 1 Yx'll lllC 


MYH6 


GDB:120214 


MYOSIN, HEAVY POLYPEPTIDE 6; MYH6 


MYH7 


GDB: 1202 15 


MYOSIN, CARDL\C, HEAVY CHAIN, BETA; 
AyrvTT7 

IVl I XI / 


NP 


GDB: 120239 


NUCLEOSIDE PHOSPHORYLASE; NP 


PABPNl 


GDB:567135 


OCULOPHARYNGEAL MUSCULAR 
DYSTROPHY; OPMD OCULOPHARYNGEAL 
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MUSCULAR DYSTROPHY, AUTOSOMAL 
RECESSIVE POLYADENYLATE-BINDING 
PROTEIN-2; PABP2 


PSENl 


GDB: 135682 


ALZHEIMER DISEASE. FAMILIAL, TYPE 3; 


PYGL 




OT vrT^nPlM QTrjR AGE DISEASE VI 


RPGRIPl 


{jDo: I I4yi> /oo 


ATVyfATrpriQTQ PnisinFTSTTTA OF T FRFR T 


SERPINAl 




JrlvvJ 1 i1,J\^Cj iiNri-LDl 1 \JIS. 1 , X A 


SERPINA3 


GDB: 118955 


ALPHA-l-ANTICHYMOTRYPSIN; AACT 


SERPINA6 


GDB: 127865 


CORTICOSTEROID-BINDING GLOBULIN; 


SLC7A7 


GDB:9863033 


DIBASICAMINOACIDURIA U 


SPG3A 


GDB:230126 


SPASTIC PARAPLEGIA-3, AUTOSOMAL 
DOMINANT; SPG3A 


SPTB 


GDB: 119602 


ELLIPTOCYTOSIS, RHESUS-UNLINKED 
TYPE HEREDITARY HEMOLYTIC SPECTRIN, 
RFTA FWYTHTinPYTTP- SPTB 

DCi X CiXv X X XXXVW X X X^, ox X XJ 


TCLIA 


GDB:250785 


T-CELL LYMPHOMA OR LEUKEMIA 


TCRAV17S1 


GDB:642130 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNTT; TCRA 


TCRAV5S1 


GDB:451966 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNTT; TCRA 
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TGMl 


GDB: 125299 


TRANSGLUTAMINASE 1; TGMl ICHTHYOSIS 
CONGENITA 


TITFl 


GDB: 132588 


THYROID TRANSCRIPTION FACTOR 1; TITFl 


TMIP 


GDB:9954523 


AND ULNA, DUPLICATION OF, WITH 
ABSENCE OF TIBIA AND RADIUS 


TRA@ 


GDB: 120404 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TSHR 


GDB: 1253 13 


THYROTROPIN, UNRESPONSIVENESS TO 


USHIA 


GDB: 11 8885 


USHER SYNDROME, TYPE lA; USHIA 


VP 


GDB: 120492 


PORPHYRIA VARIEGATA 



Table 17: Genes, Locations and Genetic Disroders on Chromosome 15 



Gene 


GDB Accession ID 


OMIM Link 


ACCPN 


GDB:5457725 


CORPUS CALLOSUM, AGENESIS OF, WITH 
NEURONOPATHY 


AH02 


GDB:9954535 


HEREDITARY OSTEODYSTROPHY-2; 
AH02 


ANCR 


GDB: 119678 


ANGELMAN SYNDROME 


B2M 


GDB: 119028 


BETA-2-MICROGLOBULIN; B2M 


BBS4 


GDB:511199 


BARDET-BIEDL SYNDROME, TYPE 4; 
BBS4 


BLM 


GDB: 135698 


BLOOM SYNDROME; BLM 
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CAPN3 


GDB:119751. 


CALPAIN, LARGE POLYPEPTIDE L3; 

CAPN3 MUSCULAR DYSTROPHY, 
LIMB-GERDLE, TYPE 2; LGMD2 


CDANl 


GDB:9823267 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE I 


CDAN3 


GDB:386192 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE III; CDAN3 


CLN6 


GDB:4073043 


CEROID-LIPOFUSCINOSIS, NEURONAL 6, 
LATE INFANTILE, VARIANT; CLN6 


CMH3 


GDB: 138299 


CARDIOMYOPATHY, FAMILL\L 
HYPERTROPHIC. 3; CMH3 


CYP19 


GDB: 119830 


CYTOCHROME P450, SUBFAMILY XIX; 
CYP19 


CYPlAl 


GDB: 120604 


CYTOCHROME P450, SUBFAMILY I, 
POLYPEPTIDE 1; CYPlAl 


CYP1A2 


GDB: 118780 


CYTOCHROME P450, SUBFAMILY I, 

POT VPPPTTFlF 0' r'VPl A 7 


DYXl 


GDB: 1391 796 


DYSLEXIA, SPECIFIC, I; DYXl 


EPB42 


GDB: 127385 


HEREDITARY HEMOLYTIC PROTEIN 4.2, 
FRVTRROPYTrP' FPR42 




\JLft3. 1171^1 


ni TITARTPAPTnilRIA IIA- GA IIA 




GDB-4590306 


EYE COLOR-3* EYCL3 


FAH 


GDB: 119901 


TYROSINEMDV, TYPE I 
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FBNl 


GDB:127115 


FIBRlLLIN-1; FBNl MARFAN SYNDROME; 
MFS 


FES 


GDB: 119906 


V-FES FELINE SARCOMA VIRAiyV-FPS 
FUJINAMI AVIAN SARCOMA VIRAL 
ONCOGENE 


HCVS 


GDB: 119306 


CORONAVIRUS 229E SUSCEPTIBILITY; 


Til • V A 

HEXA 


GDB: 120040 


1 AY-S ACxlo JJloJDAbii; 1 ou 


rvD 


GDB: 119354 


ISOVALERICACIDEMIA; IVA 


LCSl 


GDB: 11500552 


CHOLESTASIS-LYMPHEDEMA 
o Y In JJKUMJb 


LIPC 


vjDB.I lyioo 


T TD A CC UCDAT*?/^. T TO/^ 

LrlFAot, rlcr'A 1 IC, LLrU 


MY05A 


CjDB:218824 


MYUolN VA; MYU^A 


OCA2 


GDB: 136820 


A T "DTKTTOliiT TT 


DISCI 


vjUd :yoo0473 


D 1 V-lol-^LiCKvJolo 


PWCR 


GDB: 120325 


PRADER-WILLI SYNDROME 


RLBPl 


GDB:127341 


RETIN ALDEHYDE-BINDING PROTEIN 1,; 
RLBPl 


SLC12A1 


GDB:386121 


SOLUTE CARRIER FAMILY 12, MEMBER 1; 
SLC12AI 


SPG6 


GDB:5 11201 


SPASTIC PARAPLEGIA 6, AUTOSOMAL 
DOMINANT; SPG6 
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TPMl 


GDB: 127875 


TROPOMYOSIN 1; TPMl 


UBE3A 


GDB:228487 


ANGELMAN SYNDROME 
UBIQUITIN-PROTEIN LIGASE E3 A; UBE3A 


WMS 


GDB:5583902 


WEILL-MARCHESANl SYNDROME 



Table 18: Genes, Locations and Genetic Disorders on Chromosome 16 



Gene 


GDB Accession ID 


OMIM Link 






PSEUDOXANTHOMA ELASTICUM. 
AUTOSOMAL DOMINANT; PXE 
PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL RECESSIVE; PXE 


ALDOA 


GDB: 11 8993 


ALDOLASE A, FRUCTOSE-BISPHOSPHATE; 
ALDOA 


APRT 


GDB: 119003 


ADENINE 

PHOSPHORIBOSYLTRANSFERASE; APRT 


ATP2A1 


GDB:1 19716 


ATPase. Ca(2+)-TRANSPORTING, 
FAST-TWITCH 1; ATP2A1 BRODY 
MYOPATHY 


BBS2 


GDB:229992 


BARDET-BIEDL SYNDROME, TYPE 2; BBS2 


CARD15 


GDB: 11026232 


SYNOVITIS, GRANULOMATOUS. WITH 
UVEITIS AND CRANIAL NEUROPATHIES 
REGIONAL ENTERITIS 


CATM 


GDB:701219 


MICROPHTHALMIA-CATARACT 


CDHl 


GDB: 120484 


CADHERJN 1;CDH1 
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CETP 


GDB: 119773 


CHOLESTERYL ESTER TRANSFER PROTEIN, 


CHST6 


GDB:131407 


CORNEAL DYSTROPHY, MACULAR TYPE 


CLN3 


GDB: 120593 


CEROID-LIPOFUSCINOSIS, NEURONAL 3, 
JUVENILE; CLN3 


CREBBP 


GDB:437I59 


RUBINSTEIN SYNDROME CREB-BINDING 

rxCvJlCilM, K^JSJCDiSr 


Clrl 


KjDd.I lyuoo 


C Y o 1 A 1 illLIJN IXN U KIA 


CTM 


GDB: 119819 


CATARACT, ZONULAR 


CYBA 


GDB: 125238 


GRANULOMATOUS DISEASE, CHRONIC, 
AUTOSOMAL CYTOCHROME-b-NEGATIVE 
FORM 


CYLD 


GDB:701216 


EPITHELIOMA, HEREDITARY MULTIPLE 
JbJJiiNHjrlN L/YollL/ 


Drib 


LrJJo . y y J oZOo 




UIn AoJb 1 


ljriJi5:l Ji^o4o 




DPEPl 


GDB: 128059 


RENAL DIPEPTIDASE 


ERCC4 


GDB:ll91i3 


EXCISION-REPAIR, COMPLEMfaN TING 
DEFECTIVE, IN CHINESE HAMSTER, 4; 
ERCC4 XERODERMA PIGMENTOSUM, 
COMPLEMENTATION GROUP F; XPF 


FANCA 


GDB:701221 


FANCONI ANEMIA, COMPLEMENTATION 
GROUP A; FACA 
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GALNa 




MTTrTiPrjl V<lAr'CHARID05?TS TYPE IV A 


GAN 


GDB:9864885 


NEUROPATHY, GIANT AXONAL; GAN 


HAGH 


GDB: 119292 


HYDROXY ACYL GLUTATHIONE 

n X \Ji\\J\^f\^3Z>^ nr\.vjrx 


HBAl 


GDB: 11 9293 


TTT7\yfr\r:T nuTW at PMA T 001^-1 • HRA 1 

rlXlJVlVJVjJLlJolfN — rvLfJr JIA J-^WV^ \J o- 1 ^ rXDr\ 1 


HBA2 


GDB: 119294 


HEMOGLOBIN-ALPHA LOCUS-2; HBA2 


HBHR 


GDB:9954541 


HEMOGLOBIN H-RELATED MENTAL 

JSJl 1 AKJLi A 1 iKJri 


HBQl 


GDB: 120036 


rlJ2lVl\-rOLVjx3lIN — 1 FLU 1 £\r I X-»\-/V^ U O , rXD\^ 1 


HBZ 


GDB: 1 19302 




HBZP 


GDB: 120037 




HP 


GDB:1 19314 


HAPTOGLOBIN; HP 


HSD11B2 


GDB:409951 


CORTISOL 1 1-BETA-KETOREDUCTASE 


TT AT^ 

IL4R 


ODo: L l<SoZJ 




LLPB 


ODt5. 1 ly^OD 






GDB- 135 162 


MELANOCORTIN-l RECEPTOR; MCIR 


MEFV 


GDB:125263 


MEDITERRANEAN FEVER, FAMILIAL; MEFV 
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MHC2TA 


GDB:6268475 


MHC CLASS U TRANSACTIVATOR; MHC2TA 


MLYCD 


GDB: 11500940 


MALONYL CoA DECARBOXYLASE 
DEFICIENCY 


PHKB 


GDB: 120286 


PHOSPHORYLASE KINASE, BETA SUBUNIT; 
PHKB 


PHKG2 


GDB: 1403 16 


PHOSPHORYLASE KINASE, TESTIS/LIVER, 
GAMMA 2; PHKG2 


PKDl 


GDB: 120293 


POLYCYSTIC KIDNEYS POLYCYSTIC 
KIDNEY DISEASE 1; PKDl 


PKDTS 


GDB:9954545 


POLYCYSTIC KIDNEY DISEASE, INFANTILE 
SEVERE, WITH TUBEROUS SCLEROSIS; 


PMM2 


GDB:438697 


CARBOHYDRATE-DEFICIENT 
GLYCOPROTEIN SYNDROME, TYPE I; CDGl 
PHOSPHOMANNOMUTASE 2; PMM2 


PXE 


GDB:6053895 


PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL DOMINANT; PXE 
PSEUDOXANTHOMA ELASTICUM. 
AUTOSOMAL RECESSIVE; PXE 


SALLl 


GDB:4216161 


TOWNES-BROCKS SYNDROME; TBS 

CAT T TVC 1 • C A T T 1 


SCA4 


GDB:250364 


SPINOCEREBELLAR ATAXIA 4; SCA4 


SCNNIB 


GDB:434471 


SODIUM CHANNEL. NONVOLTAGE-GATED 
1 BETA; SCNNIB 


SCNNIG 


GDB:568759 


SODIUM CHANNEL, NONVOLTAGE-GATED 
1 GAMMA; SCNNIG 
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TAT GDB : 1 20398 TYROSINE TRANSAMINASE DEFICIENCY 



TSC2 GDB: 120466 TUBEROUS SCLEROSIS-2; TSC2 



VDI GDB: 1 1 9629 DEFECTIVE INTERFERING PARTICLE 

INDUCTION, CONTROL OF 



WT3 GDB:9958957 WILMS TUMOR, TYPE UI; WT3 



Table 19: Genes, Locations and Genetic Disorders on Chromosome 17 



Gene 



ABR 



GDB Accession ID 



GDB: 119642 



OMIM Link 



ACTIVE BCR-RELATED GENE; ABR 



ACACA 



GDB:120534 



ACETYL-CoA CARBOXYLASE 
DEFICIENCY 



ACADVL 



GDB:1248185 



ACYL-CoA DEHYDROGENASE, 
VERY-LONG-CHAIN, DEFICIENCY OF 



ACE 



GDB: 119840 



DIPEPTIDYL CARBOXYPEPTIDASE-1; 
DCPl 



ALDH3A2 



APOH 



GDB: 1316855 



SJOGREN-LARSSON SYNDROME; SLS 



GDB: 11 8887 



APOLIPOPROTEIN H; APOH 



ASPA 



GDB:231014 



SPONGY DEGENERATION OF CENTRAL 
NERVOUS SYSTEM 



AXIN2 



GDB:9864782 



CANCER OF COLON 



BCL5 



GDB:125178 



LEUKEMLVLYMPHOMA, CHRONIC 
B-CELL, 5; BCL5 
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BHD 


GDB: 11 498904 


WITH TRICHODISCOMAS AND 


OJLJVLrl 






BRCAl 


GDB: 1266 11 


BREAST CANCER, TYPE 1; BRCAl 


CACD 


GDB:5885801 


CHOROIDAL DYSTROPHY, CENTRAL 
AREOLAR; CACD 


CCAl 


GDB: 118763 


CATARACT, CONGENITAL. CERULEAN 
TYPE 1; CCAl 


CCZS 


GDB:681973 


CATARACT, CONGENITAL ZONULAR, 
WITH SUTURAL OPACITIES; CCZS 


CHRNBl 


GDB: 120587 


CHOLINERGIC RECEPTOR, NICOTINIC, 
BETA POLYPEPTIDE 1 ; CHRNBl 


CHRNE 


GDB: 132246 


CHOLINERGIC RECEPTOR, NICOTINIC, 
EPSILON POLYPEPTIDE; CHRNE 


CMTIA 


GDB: 119785 


CHARCOT-MARIE-TOOTH DISEASE, TYPE 
1 A; CMTIA NEUROPATHY. HEREDITARY. 
WITH LIABILITY TO PRESSURE PALSIES; 
HNPP 


COLlAl 


GDB: 119061 


COLLAGEN, TYPE I, ALPHA- 1 CHAIN; 
COLlAl OSTEOGENESIS IMPERFECTA 
TYPE I OSTEOGENESIS IMPERFECTA 

1 I I'll IV, \J14 


CORDS 


GDB:568473 


CONE-ROD DYSTROPHY-5; CORDS 


CTNS 


GDB:700761 


CYSTINOSIS. EARLY-ONSET OR 
INFANTILE NEPHROPATHIC TYPE 
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EPX 


GDB:377700 


EOSINOPHIL PEROXIDASE; EPX 


ERBB2 


GDB:120613 


V-ERB-B2 AVIAN ERYTHROBLASTIC 
LEUKEML\ VIRAL ONCOGENE HOMOLOG 
2; ERBB2 


G6PC 


GDB:231927 


GLYCOGEN STORAGE DISEASE I; GSD-I 


GAA 


GDB: 119965 


GLYCOGEN STORAGE DISEASE II 


GALKl 


GDB:119246 


GALACTOKINASE DEFICIENCY 


GCGR 


GDB:304516 


GLUCAGON RECEPTOR; GCGR 


GFAP 


GDB: 11 8799 


GLIAL FIBRILLARY ACIDIC PROTEIN; 
GFAP ALEXANDER DISEASE 


GHl 


GDB: 119982 


GROWTH HORMONE 1; GHl 


GH2 


GDB: 119983 


GROWTH HORMONE 2; GH2 


GPIBA 


GDB: 118806 


GIANT PLATELET SYNDROME 


GPSC 


GDB:9954564 


FAMILL\L PROGRESSIVE SUBCORTICAL 


GUCY2D 


GDB:136012 


AMAUROSIS CONGENITA OF LEBER I 
GUANYLATE CYCLASE 2D, MEMBRANE; 
GUC2D CONE-ROD DYSTROPHY-6; CORD6 


ITGA2B 


GDB:120012 


THROMBASTHENIA OF GLANZMANN 
AND NAEGELI 


ITGB3 


GDB:120013 


INTEGRIN, BETA-3; ITGB3 


ITGB4 


GDB: 128028 


INTEGRIN, BETA-4; n'GB4 
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KRTIO 


GDB:118828 


KERATIN 10; KRTIO 


KRT12 


GDB:5583953 


CORNEAL DYSTROPHY, JUVENILE 
EPITHELIAL, OF MEESMANN KERATIN 12; 


KRT13 


GDB: 120740 


KERATIN 13; KRT13 


KRT14 


GDB:132145 


KERATIN 14; KRT14 GLUTATHIONE 

O 1 IN I Fill 1 /\k^JD, VJOO 


J\Jv 1 1 i 




AXTXT 1/1- VUX1>1 




rSFiR-l 901 00 

yjxjD . i 1 








1^T71? ATTM 1 A' T^TJTl A 


Ill/ 




fPP ATTM 1 T^PTl 
JSJQJx/\lXiN ID, JVtvl iO 




r^np-i90i9^ 


1^T7P ATTM 1 VDTl 
JSJDlv/Y i ilN lO, JVlvl iO 


KRT16L2 


GDB: 120126 


KERATIN 16; KRT16 


KRT17 


GDB:136211 


KERATIN 17; KRT 17 PACHYONYCHIA 
CONGENITA, JACKSON-LA WLER TYPE 


KRT9 


GDB:3O3970 


HYPERKERATOSIS, LOCALIZED 
EPIDERMOLYTIC 


MAPT 


GDB: 119434 


MICROTUBULE-ASSOCIATED PROTEIN 
lAU, MArl JrAJ-LlJUUrUNlUNlUKAL 
DEGENERATION; PPND 
DISINHIBITION-DEMENTIA-PARKINSONIS 
M-AMYOTROPHY COMPLEX; DDPAC 


MDB 


GDB:9958959 


MEDULLOBLASTOMA; MDB 
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MDCR 




lUTT T FT? niFKER LISSENCFPHALY 
SYNDROME; MDLS 
PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, GAMMA SUBUNIT 


MGI 


GDB:9954550 


MYASTHENIA GRAVIS, FAMILIAL 
INFANTILE; FIMG 


MHS2 


GDB: 132580 


MALIGNANT HYPERTHERMIA 
T^r'KPTTRTT TTY-2' MHS2 


MKSl 


vjiJxi^ooiyD/ 




MPO 




lUVFT OPFROXTDASE DEFICIENCY 


MUL 


GDB:636050 


MULIBREY NANISM; MUL 


MY015A 


GDB:9838006 


DEAFNESS, NEUROSENSORY, 
ATITO^OMAT RECESSIVE 3* DFNB3 


NAGLU 


GDB:636533 


MUCOPOLYSACCHARIDOSIS TYPE mB 


NAPB 


GDB:9954572 


NEURITIS WITH BRACHIAL 
PRFDTI ECTTON- NAPB 


KFl 


GDB: 12023 1 


NEUROFIBROMATOSIS, TYPE I; NFl 


NMEl 


GDB: 127965 


NON-MET ASTATIC CELLS I. PROTEIN 
EXPRESSED IN; NMEl 


P4HB 


GDB:120708 


PROLYL-4-HYDROXYLASE, BETA 
POLYPEPTIDE; PHDB; PROHB 




GDB-677430 


MILLER-DIEKER LISSENCEPHALY 
SYNDROME; MDLS 
PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, GAMMA SUBUNIT 
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PECAMl 


GDB:696372 


PLATELET-ENDOTHELIAL CELL 
ADHESION MOLECULE; PECAMl 


PEX12 


GDB:61 55804 


ZELLWEGER SYNDROME; ZS 


PHB 


GDB: 126600 


PROHIBITIN; PHB 


PMP22 


GDB:134190 


CHARCOT-MARIE-TOOTH DISEASE, TYPE 
lA; CMTl A HYPERTROPHIC 
NEUROPATHY OF DEJERINE-SOTTAS 
PERIPHERAL MYELIN PROTEIN 22; PMP22 


PRKARIA 


GDB: 1203 13 


MYXOMA, SPOTTY PIGMENTATION, AND 
ENDOCRINE OVERACTIVITY PROTEIN 
KINASE, cAMP-DEPENDENT, 
KJ1GUI-.AIOKY, 1 Yrk. 1, ALrriA; FKKAKI A 


PRKCA 


GDB: 12801 5 


PROTEIN KINASE C, ALPHA; PRKCA 


PRKWNK4 


GDB:9954566 


PSEUDOHYPOALDOSTERONISM TYPE U, 

T rf^/^T TC "D. DXT A 

LrUCUo £>; FHA^D 


rKro 


ouo:yy37oy7 


Kh 1 LNl 1 lo rlvjrJVLfcN 1 Uo A- 1 3; Kr 1 3 


r'lvrr o 


tjUo:jy/o4/ 


KJil JJNll lb fiOMxiN lUoA-13; Kr 13 


PTLAH 


GDB:9957342 


APLASL\ OR HYPOPLASIA 


RARA 


GDB: 120337 


RETINOIC ACID RECEPTOR, ALPHA; 

P AT? A 


RCVl 


GDB: 135477 


RECOVERIN; RCVl 


RMSAl 


GDB:304519 


REGULATOR OF MITOTIC SPINDLE 
ASSEMBLY 1; RMSAl 
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RP17 


GDB:683199 


RETDSrmS PIGMENTOSA-17; RP17 


RSS 


GDB:439249 


RUSSELL-SILVER SYNDROME; RSS 


SCN4A 


GDB:125181 


PERIODIC PARALYSIS II 


SERPINF2 


GDB: 120301 


PLASMIN INHIBITOR DEFICIENCY 


SGCA 


GDB:384077 


ADHALIN; ADL 


SGSH 


GDB:1319101 


MUCOPOLYSACCHARIDOSIS TYPE IIIA 


SHBG 


GDB: 125280 


SEX HORMONE BINDING GLOBULIN; 
SHBG 


SLC2A4 


GDB: 1 19997 


SOLUTE CARRIER FAMILY 2, MEMBER 4; 
SLC2A4 


SLC4A1 


GDB: 119874 


SOLUTE CARRIER FAMILY 4, ANION 
EXCHANGER, MEMBER 1; SLC4A1 BLOOD 
GROUP-DIEGO SYSTEM; DI BLOOD 
GROUP-WRIGHT ANTIGEN; Wr 
ELLIPTOCYTOSIS. RHESUS-UNLINKED 
TYPE HEREDITARY HEMOLYTIC 


SLC6A4 


GDB:134713 


SOLUTE CARRIER FAMILY 6, MEMBER 4; 
SLC6A4 


SMCR 


GDB:120379 


SMITH-MAGENIS SYNDROME; SMS 


SOST 


GDB: 10450629 


SCLEROSTEOSIS 


SOX9 


GDB: 134730 


DYSPLASIA 


SSTR2 


GDB: 1341 86 


SOMATOSTATIN RECEPTOR-2; SSTR2 
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SYMl 


GDB:512174 


SYMPHALANGISM, PROXIMAL; SYMl 


SYNSl 


GDB:9862343 


SYNOSTOSES, MULTIPLE, WITH 
BRACHYDACTYLY 


TCF2 


GDB: 125298 


TRANSCRIPTION FACTOR-2, HEPATIC; 
TCF2 


THRA 


GDB: 120730 


THYROID HORMONE RECEPTOR, ALPHA 
1;THRA 


TIMP2 


GDB: 1326 12 


TISSUE INHIBITOR OF 
METALLOPROTEINASE-2; TIMP2 


TOC 


GDB:451978 


TYLOSIS WITH ESOPHAGEAL CANCER; 
TOC 


TOP2A 


GDB: 11 8884 


TOPOISOMERASE (DNA) II, ALPHA; 
TOP2A 


TP53 


GDB:120445 


CANCER, HEPATOCELLULAR 
LI-FRAUMENI SYNDROME; LFS TUMOR 
PROTEIN p53; TP53 CARCINOMA 


VBCH 


GDB:9954554 


HYPEROSTOSIS CORTICALIS 
GENERALISATA 



Table 20: Genes, Locations and Genetic Disorders on Chromosome 18 



Gene 


GDB Accession ID 


OMIM Link 


ATP8B1 


GDB:453352 


CHOLESTASIS, PROGRESSIVE FAMILIAL 
INTRAHEPATIC 1; PFICl INTRAHEPATIC 
CHOLESTASIS FAMILIAL INTRAHEPATIC 
CHOLESTASIS-!; HCl 


BCL2 


GDB: 119031 


B-CELL CLL/LYMPHOMA 2; BCL2 
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CNSN 






CORDl 


GDB: 118773 


CONE-ROD DYSTROPHY- 1; CORDl 


CYB5 


GDB: 125236 


METHEMOGLOBINEMIA DUE TO 
DEFICIENCY OF CYTOCHROME b5 


DCC 


GDB: 119838 


DELETED IN COLORECTAL CARCINOMA; 
DCC 


F5F8D 


GDB:6919858 


FACTOR V AND FACTOR VIU, COMBINED 


FECH 


GDB:127282 


PROTOPORPHYRL\, ERYTHROPOIETIC 


FEO 


GDB:4378120 


POLYOSTOTIC OSTEOLYTIC DYSPLASIA, 


LAMA3 


GDB:25l8l5 


T A livrTXTTTVT A T T>XJ A 1* 1 A \A A 1 


LCFS2 


GDB:9954578 


CANCER 


MADH4 


GDB:4642788 


POLYPOSIS, JUVENILE INTESTINAL 
MOTHERS AGAINST DECAPENTAPLEGIC, 
DROSOPHILA, HOMOLOG OF, 4; MADH4 


MAFDl 


GDB:120163 


MANIC-DEPRESSIVE PSYCHOSIS, 

AT TTO^nM A T 


MC2R 


GDB: 135 163 


ADRENAL UNRESPONSIVENESS TO ACTH 


MCL 


GDB:9954574 


LEIOMYOMATA, HEREDITARY MULTIPLE, 
OF SKIN 


MYP2 


GDB:9862232 


MYOPIA 
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NPCl 


CjUd.I jol /o 


iNUtMANN-x'lCIv UlotiAoil, 1 Yrb 1^1; NrCI 


SPPK 


GDB:606444 


PALMOPLANTARIS STRIATA 


TGFBRE 


GDB:250852 


TRANSFORMING GROWTH FACTOR, BETA 1 
KlibrONbb bLcJVLciN 1 


ILrlr 






TTR 


GDB:1 19471 


TRANSTHYRETIN; TTR 



Table 21 : Genes, Locations and Genetic Disorders on Chromosome 19 



Gene 


GDB Accession ID 


OMIM Link 


AD2 


GDB: 118748 


ALZHEIMER DISEASE-2; AD2 


AMH 


GDB: 118996 


PERSISTENT MULLERIAN DUCT 
SYNDROME, TYPES I AND H; PMDS 
ANTI-MULLERIAN HORMONE; AMH 


APOC2 


GDB: 119689 


APOLIPOPROTEIN C-H DEFICIENCY, 
TYPE I HYPERLIPOPROTEINEMIA DUE 
TO 


APOE 


GDB: 119691 


APOLIPOPROTEIN E; APOE 


ATHS 


GDB: 128803 


LIPOPROTEIN PHENOTYPE; ALP 


BAX 


GDB:228082 


BCL2-ASSOCIATED X PROTEIN; BAX 


BCKDHA 


GDB: 119723 


MAPLE SYRUP URINE DISEASE 


BCL3 


GDB: 120561 


B-CELL LEUKEMIA/LYMPHOMA-3; BCL3 
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BFIC 


GDB:9954584 


BENIGN FAMILIAL INFANTILE 
CONVULSIONS 


C3 


GDB: 119044 


COMPLEMENT COMPONENT-3; C3 


CACNAIA 


GDB: 126432 


ATAXL\, PERIODIC 

VESTIBULOCEREBELLAR HEMIPLEGIC 
MIGRAINE, FAMILL\L; MHP 

CT»TMOr'TIl?T;RFI T AP ATAXIA 6" SCAfi 

CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, P/Q TYPE. 
ALPHA 1 A SUBUNIT- CACNAIA 






PFNTRAT CORE DISEASE OF MUSCLE 


CEACAM5 


GDB: 11 9054 


CARCINOEMBRYONIC ANTIGEN; CEA 


COMP 


GDB:344263 


COTDtrvCC A T FWCPT AQTA A/fTTTTTPTF* 
tix^Urrl Y oxiAi^ D i oJrl-»/\olA, JVIUI^ 1 Lr JL/li, 

MED PSEUDOACHONDROPLASTIC 
DYSPLASIA CARTILAGE OLIGOMERIC 
MATRIX PROTEIN; COMP 


CRX 


GDB:333932 


AMAUROSIS CONGENITA OF LEBER I 
CONE-ROD HOMEO BOX-CONTAINING 
GENE 


DBA 


GDB:9600353 


ANEMIA, CONGENITAL HYPOPLASTIC, 
OF BLACICFAN AND DIAMOND 


DDU 


GDB: 10796026 


URTICARIA; DDU 


DFNA4 


GDB:606540 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 4; 
DFNA4 


DLLS 


GDB:9959026 


VERTEBRAL ANOMALIES 
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DMPK 


nFfcU- 1 1 ono'? 
OiJo. 1 1 / 


T^VQTROPWTA MVOTOlMir'A- DM 


L/JVl WJJ 


VJiJO. / 1 / OJ 




DPDl 


OJJo. lU /yoi /U 




EUS 


GDB:1 19101 


ECHO 1 1 SENSITIVITY; El IS 


ELA2 


GDB: 118792 


ELASTASE-2; ELA2 NEUTROPENIA, 
CYCLIC 


EPOR 


GDB: 125242 


ERYTHROPOIETIN RECEPTOR; EPOR 


ERCC2 


GDB:119112 


EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 2; 
ERCC2 XERODERMA PIGMENTOSUM IV; 
XP4 


ETFB 


GDB: 119887 


ELECTRON TRANSFER FLAVOPROTEIN, 
ob 1 A rULirhir i iunj r. 1 rl3 


EXT3 


GDB:383780 


cAUo 1 UollOy MUl^ 1 Lr JLtl, 1 Y ril 111, tiA 1 J 


EYCLl 


GDB: 11 9259 


liYb CULvJK-l, HiCJ-tl 


r I L 




r titvJtsJ. 1 JiN i^ion 1 i^n^rviiN , r ili 


FUTl 


OUJd. IzUoio 


PT Trr^QVT TP AM^FFP A^F-1 • FT TTl 


FUT2 




PI TI^I^C VT TP A WQPPP A ^F-9 ■ FT TT9 




GDB- 135 180 


FUCOSYLTRANSFERASE-6: FUT6 


GAMT 


GDB:1313736 


GUANIDINOACETATE 
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METHYLTRAN or xiKAo£; QjAMI 


GCDH 


GDB: 136004 


GLUTARICACiUfcMlA 1 


GPI 


GDB:120015 


GLUCOSEPHOSPHATE ISOMERASE; GPI 


GUSM 


GDB:1 19291 


GLUCURONIDASE, MOUSE, MODIFIER 
OF; GUSM 


HBl 


GDB:9954586 


BUNDLE BRANCH BLOCK 


HCLl 


GDB:1 19304 


HAIR COLOR-1; HCLl 


HHC2 


GDB:249836 


HYPOCALCIURIC HYPERCALCEML\, 
FAMILL\L, TYPE U; HHC2 


HHC3 


GDB:9955121 


HYPOCALCIURIC HYPERCALCEMIA. 
FAMILIAL, TYPE HI; HHC3 


ICAM3 


GDB:1 36236 


INTERCELLULAR ADHESION 
MOLECULE-3; ICAM3 


INSR 


GDB: 11 9352 


INSULIN RECEPTOR; INSR 


JAK3 


GDB:376460 


JANUS KINASE 3 JAK3 


KLK3 


GDB: 119695 


ANTIGEN, PROSTATE-SPECIFIC; APS 


LDLR 


GDB: 119362 


HYPERCHOLESTEROLEMIA. FAMILIAL; 
FHC 


LHB 


GDB: 119364 


LUTEINIZING HORMONE, BETA 
POLYPEPTIDE; LHB 
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LIGl 


GDB: 127274 


LIGASE I, DNA, ATP-DEPENDENT; LIGl 


LOH19CR1 


GDB:9837482 


ANEMIA, CONGENITAL HYPOPLASTIC, 
OF RT APKFAN AND DIAMOND 


T VT 1 




T FT TkTFMT A T VMPHOfn 1 • T VT 1 


iVLrtJNZD 1 


yjuo. 1 1 7 J / o 




MCOLNl 


GDB:10013974 


MUCOLIPIDOSIS IV 


MDRV 


GDB:6306714 


MUSCULAR DYSTROPHY, AUTOSOMAL 
DOMINANT, WITH RIMMED VACUOLES; 
MDRV 


MLLTl 


GDB: 136791 


MYELOID/LYMPHOID OR MIXED 
LINEAGE LEUKEMIA, TRANSLOCATED 
TO, 1; MLLTl 


NOTCH3 


GDB:361163 


DEMENTIA, HEREDITARY 
MULTI-INFARCT TYPE NOTCH. 
DROSOPHILA, HOMOLOG OF. 3; NOTCH3 


NPHSl 


GDB:342105 


NEPHROSIS 1, CONGENITAL, FINNISH 

1 I X Cry iNrXloi 


OFC3 


GDB: 128060 


OROFACIAL CLEFT-3; OFC3 


OPA3 


GDB:9954590 


OPTIC ATROPHY, INFANTILE, WITH 
CHORFA AND SPASTIC PARAPLEGIA 


PEPD 


GDB- 120273 


PEPTIDASE D* PEPD 


PRPF31 


GDB:3339ll 


RETINITIS PIGMENTOSA 1 1; RPl 1 
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PRTN3 


GDB: 126876 


PROTEINASE 3; PRTN3; PR3 


PRX 


GDB: 1 1501256 


HYPERTROPHIC NEUROPATHY OF 
DEJERINE-SOTTAS 


PSGl 


GDB: 120321 


PREGNANCY-SPECIFIC 

BETA-1 -GLYCOPROTEIN 1; PSGl 


PVR 


GDB: 1 20324 


POLIOVIRUS SUSCEPTIBILITY, OR 

SENsrrivrrY; pvs 


RYRl 


GDB: 120359 


CENTRAL CORE DISEASE OF MUSCLE 
HYPERTHERMIA OF ANESTHESIA 
RYANODINE RECEPTOR-1 ; RYRl 


SLC5A5 


GDB:5892184 


SOLUTE CARRIER FAMILY 5, MEMBER 5; 
SLC5A5 


SLC7A9 


GDB:9958852 


CYSTINURIA, TYPE ffl; CSNU3 


STKll 


GDB:9732383 


PEUTZ-JEGHERS SYNDROME 
SERINE/THREONINE PROTEIN KINASE 
11; STKll 


TBXA2R 


GDB:127517 


THROMBOXANE A2 RECEPTOR, 
PLATELET; TBXA2R 


TGFBl 


GDB:120729 


ENGELMANN DISEASE TRANSFORMING 
GROWTH FACTOR, BETA-1; TGFBl 


TNNI3 


GDB: 125309 


TROPONIN I, CARDL\C; TNNI3 


TYROBP 


GDB:9954457 


POLYCYSTIC LIPOMEMBRANOUS 
OSTEODYSPLASIA WITH SCLEROSING 
LEUKOENCEPHALOPATHY 



Table 22: Genes, Locations and Genetic Disorders on Chromosome 20 
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OMIIVr T ink 


ADA 


GDB:119649 


ADENOSINE DEAMINASE; ADA 


AHCY 


GDB:118983 


S-ADENOSYLHOMOCYSTEINE 
HYDROLASE; AHCY 


AVP 


GDB: 119009 


DIABETES INSIPIDUS, 
NEUROHYPOPHYSEAL TYPE ARGININE 
VASOPRESSIN; AVP 


CDAN2 


GDB:9823270 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE H 


CDMPl 


GDB:438940 


CHONDRODYSPLASIA, GREBE TYPE 
CARTILAGE-DERIVED MORPHOGENETIC 
PROTEIN 1 


CHEDl 


GDB:3837719 


CORNEAL DYSTROPHY, CONGENITAL 
ENDOTHEUAL; CHED 


CHRNA4 


GDB:128Io9 


CHOLINERGIC RECEPTOR, NEURONAL 
NICOTINIC, ALPHA POLYPEPTIDE 4; 
CHRNA4 EPILEPSY, BENIGN NEONATAL; 


CST3 


GDB:119817 


AMYLOIDOSIS VI 


EDN3 


GDB: 119862 


ENDOTHELIN-3; EDN3 
WAARDENBURG-SHAH SYNDROME 


EEGVl 


GDB: 127525 


ELECTROENCEPHALOGRAM, 
T OW-VOT TAGF 


FTLLl 


GDB: 119235 


FERRITIN LIGHT CHAIN; FTL 


GNAS 


GDB: 120628 


GUANINE NUCLEOTIDE-BINDING 
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PROTEIN, ALPHA-STIMULATING 
POLYPEPTIDE; 


GSS 


GDB:637022 


GLUTATHIONE SYNTHETASE DEFICIENCY 
OF ERYTHROCYTES, HEMOLYTIC 
ANEML\ PYROGLUTAMICACIDURIA 
HNF4AGDB:393281DL\BETES MELLITUS, 
AUTOSOMAL DOMINANT 
TRANSCRIPTION FACTOR 14. HEPATIC 
NUCLEAR FACTOR; TCF14 


JAGl 


GDB:6 175920 


CHOLESTASIS WITH PERIPHERAL 
PULMONARY STENOSIS JAGGED 1; JAGl 


KCNQ2 


GDB:9787229 


EPILEPSY, BENIGN NEONATAL; EBNl 
POTASSIUM CHANNEL, VOLTAGE-GATED, 
SUBFAMILY Q, MEMBER 2 


MKKS 


GDB:9860197 


HYDROMETROCOLPOS SYNDROME 


NBIAl 


GDB:4252819 


HALLERVORDEN-SPATZ DISEASE 


PCKl 


GDB: 125349 


PHOSPHOENOLPYRUVATE 
CARBOXYKINASE 1, SOLUBLE; PCKl 


PD 


GDB:203940 


PROTEINASE INHIBITOR 3; PD 


PPGB 


GDB: 119507 


NEURAMINIDASE DEFICIENCY WITH 
BETA-GALACTOSIDASE DEFICIENCY 


PPMD 


GDB:702144 


CORNEAL DYSTROPHY, HEREDITARY 
POLYMORPHOUS POSTERIOR; PPCD 


PRNP 


GDB: 120720 


GERSTMANN-STRAUSSLER DISEASE; GSD 


THBD 


GDB:1 19613 


THROMBOMODULIN; THBD 
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TOPI 



GDB: 120444 



TOPOISOMERASE (DNA) I; TOPI 



Table 23: Genes, Locations and Genetic Disorders on Chromosome 21 



Gene 


GDB Accession ID 


OMIM Link 


AIRE 


GDB:567198 


AUTOIMMUNE 

POLYENDOCRINOPATHY-CANDIDL\S 
IS-ECTODERMAL DYSTROPHY; 
APECED 


APP 


GDB: 119692 


ALZHEIMER DISEASE; AD AMYLOID 
BETA A4 PRECURSOR PROTEIN; APP 


CBS 


GDB: 119754 


HOMOCYSTINUIUA 


COL6A1 


GDB: 119065 


COLLAGEN, TYPE VL ALPHA-1 
CHAIN; COL6A1 MYOPATHY, BENIGN 
CONGENITAL, WITH CONTRACTURES 


COL6A2 


GDB:1 19793 


COLLAGEN, TYPE VI, ALPHA-2 
CHAIN; COL6A2 MYOPATHY, BENIGN 
CONGENITAL. WITH CONTRACTURES 


CSTB 


GDB:5215249 


MYOCLONUS EPILEPSY OF 
UNVERRICHT AND LUNDBORG 
CYSTATIN B; CSTB 


DCR 


GDB: 125354 


TRISOMY 21 


DSCRl 


GDB:731000 


TRISOMY 21 


FPDMM 


GDB:9954610 


CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNIT 2; CBFA2 
PLATELET DISORDER, FAMILIAL, 
WITH ASSOCIATED MYELOID 
MALIGNANCY 
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HLCS 


GDB:392648 


MULTIPLE CARBOXYLASE 
DEFICIENCY, BIOTIN-RESPONSIVE; 
MCD 


HPEl 


GDB: 136065 


HOLOPROSENCEPHALY, FAMILL\L 
ALOBAR 


ITGB2 


GDB: 120574 


INTEGRIN BETA-2; ITGB2 


KCNEl 


GDB:127909 


POTASSIUM VOLTAGE-GATED 
CHANNEL, ISK-RELATED 
SUBFAMILY. MEMBER 1; 




GDB -4073 044 


KNOBLOCH SYNDROME; KNO 


PRSS7 


GDB:384083 


ENTEROKINASE DEFICIENCY 


RUNXl 


GDB: 1283 13 


CORE-BINDING FACTOR, RUNT 
DOMAIN ALPHA SUBUNIT 2- CBFA2 
PLATELET DISORDER, FAMILIAL, 
WITH ASSOCIATED MYELOID 
MALIGNANCY 


SODl 


GDB: 119596 


AMYOTROPHIC LATERAL SCLEROSIS 
STTPFROXIDE DISMUTASE-1: SODl 
MUSCULAR ATROPHY, 
PROGRESSIVE, WITH AMYOTROPHIC 
LATERAL SCLEROSIS 


TAM 


GDB:9958709 


MYELOPROLIFERATIVE SYNDROME, 
TRANSIENT 


Table 24: Genes, Locations and Genetic Disorders on Chromosome 22 


Gene 


GDB Accession ID 


OMIM Link 


ADSL 


GDB: 119655 


ADENYLOSUCCINATE LYASE; ADSL 
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ARSA 


GDB:1 19007 


METACHROMATIC 

T FT IKTiHYSTROPHY LATE-INFANTTT F 


RPR 


rJDR- 120562 


RRFAK'POTNT PI TJSTFR REGION- RPR 


CECR 


GDB: 119772 


CAT EYE SYNDROME; CES 


CHEK2 


GDB:9958730 


LI-FRAUMENI SYNDROME; LPS 
OSTEOGENIC SARCOMA 


COMT 


GDB: 11 9795 


CATECHOL-O-METHYLTRANSFERASE; 
COMT 


CRYBB2 


GDB:1 19075 


CRYSTALLIN, BETA B2; CRYBB2 
CATARACT, CONGENITAL, CERULEAN 
TYPE, 2; CCA2 


CSF2RB 


GDB:126838 


GRANULOCYTE-MACROPHAGE 
COLONY-STIMULATING FACTOR 

IJPr'T^DTr^P TiT7TA QT TOI T^^T• 


CTHM 


GDB:439247 


HEART MALFORMATIONS; CTHM 


CYP2D6 


GDB:132127 


CYTOCHROME P450, SUBFAMILY IID; 
CYP2D 


CYP2D@ 


GDB:119832 


CYTOCHROME P450, SUBFAMILY IID; 

I r^xj 


DGCR 


GDB: 119843 


DIGEORGE SYNDROME; DGS 


DIAl 


GDB: 11 9848 


METHEMOGLOBINEMIA DUE TO 
DEFICIENCY OF METHEMOGLOBIN 
REDUCTASE 


EWSRl 


GDB: 135984 


EWING SARCOMA; EWS 
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GDB- 120623 


GLUTATHIONURIA 




GDB* 1201 80 


MENINGIOMA; MGM 


JVUN i 




MENINGIOMA- MGM 


XT A A 




ALPHA-GALACTOSIDASE B: GALB 


NF2 


GDB: 120232 


NEUROFIBROMATOSIS. TYPE H; NF2 


OGS2 


GDB:9954619 


HYPERTELORISM WITH ESOPHAGEAL 
ABNORMALITY AND HYPOSPADIAS 


PDGFB 


GDB: 120709 


V-SIS PLATELET-DERIVED GROWTH 
FACTOR BETA POLYPEPTIDE; PDGFB 


PPARA 


GDB:202877 


PEROXISOME PROLIFERATOR 
ACTIVATED RECEPTOR, ALPHA; 

PPA12 A 


l:'KUJJil 




TTYPFRPROLINEMIA TYPE I 






CYTOCHROME c OXIDASE DEFICIENCY 






SCHIZOPHRENIA DISORDER-4; SCZD4 


SERPINDl 


GDB:120038 


HEPARIN COFACTOR II; HCF2 


SLC5A1 


GDB: 120375 


SOLUTE CARRIER FAMILY 5, MEMBER 
1- SLC5A1 


SOXIO 


GDB:9834028 


SRY-BOX 10; SOXIO 


TCN2 


GDB: 119608 


TRANSCOBALAMIN 11 DEFICIENCY 
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TIMP3 


GDB:138175 


TISSUE INHIBITOR OF 
METALLOPROTEINASE-3- TIMP3 


VCF 


GDB: 136422 


VELOCARDIOFACIAL SYNDROME 



Table 25: Genes, Locations and Genetic Disorders on Chromosome X 



Gene 


GDB Accession ID 


OMIM Link 


ABCDl 


GDB: 118991 


ADRENOLEUKODYSTROPHY; ALD 


ACTLl 


GDB:119648 


ACTIN-LIKE SEQUENCE-1; ACTLl 


ADFN 


GDB:118977 


ALBINISM-DEAFNESS SYNDROME; ADFN; 
ALDS 


AGMX2 


GDB:1 19661 


AGAMMAGLOBULINEMIA, X-LINKED, 
TYPE 2; AGMX2; XLA2 


AHDS 


GDB: 125899 


MENTAL RETARDATION, X-LINKED, WITH 
HYPOTONIA 


AIC 


GDB: 118986 


CORPUS CALLOSUM, AGENESIS OF. WITH 
CHORIORETINAL ABNORMALITY 


AIED 


GDB: 119663 


ALBINISM, OCULAR, TYPE 2; OA2 


AIH3 


GDB: 13 1443 


AMELOGENESIS IMPERFECTA-3, 
HYPOPLASTIC TYPE; AIH3 


ALAS2 


GDB: 119666 


ANEMIA, HYPOCHROMIC 


AMCD 


GDB:5584286 


ARTHROGRYPOSIS MULTIPLEX 
CONGENITA, DISTAL 
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AMELX 


GDB:1 19675 


AMELOGENESIS IMPERFECTA- 1, 
HYPOPLASTIC TYPE; AMI 


ANOPl 


GDB: 128454 


CLINICAL; ANOPl 


AR 


GDB: 120556 


ANDROGEN INSENSITIVITY SYNDROME; 
AIS ANDROGEN RECEPTOR; AR 


ARAFl 


GDB: 119004 


V-RAF MURINE SARCOMA 361 1 VIRAL 
ONCOGENE HOMOLOG I* ARAFl 


ARSC2 


GDB: 119702 


ARYLSULFATASE C, f FORM; ARSC2 


ARSE 


GDB:555743 


CHONDRODYSPLASIA PUNCTATA 1. 
X-LINKED RECESSIVE; CDPXl 


ARTS 


GDB:9954651 


FATAL X-LINKED, WITH DEAFNESS AND 
LOSS OF VISION 


ASAT 


GDB:9954649 


SIDEROBLASTIC, AND SPINOCEREBELLAR 
ATAXIA- ASAT 


ASSP5 


GDB:1 19019 


CITRULLINEML\ 


ATP7A 


GDB: 119395 


ATPase, Cu(2+)-TRANSPORTING, ALPHA 
POLYPEPTIDE; ATP7A MENKES 
SYNDROME 


ATRX 


GDB: 136052 


ALPHA-THALASSEMLVMENTAL 
PETAWriATimsI ^VNDPOMF Y-T INKED" 
ATRX ALPHA-THALASSEMIA/MENTAL 
RETARDATION SYNDROME, 
NONDELETION TYPE 


AVPR2 


GDB:131475 


DIABETES INSIPIDUS, NEPHROGENIC 


BFLS 


GDB: 120566 


BORJESON SYNDROME; BORJ 



-216- 



wo 2004/001010 PCT/US2003/019760 



BGN 


GDB:119727 


BIGLYCAN; BGN 


BTK 


GDB: 120542 


BRUTON AGAMMAGLOBULINEMIA 


BZX 


GDB:5205912 


BAZEX SYNDROME; BZX 


CIHR 


GDB: 11 9040 


TATA BOX BINDING PROTEIN 
(TBP)-ASSOCIATED FACTOR 2A; TAF2A 


CACNAIF 


GDB:6053864 


NIGHTBLINDNESS, CONGENITAL 
STATIONARY, X-LINKED, TYPE 2; CSNB2 
CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, ALPHA IF 


CALB3 


GDB: 133780 


CALBINDIN 3; CALB3 


CBBM 


GDB:9958963 


COLORBLINDNESS, 

BLUE-MONO-CONE-MONOCHROMATIC 
TYPE; CBBM 


CCT 


GDB: 119756 


CATARACT, CONGENITAL TOTAL, WITH 
POSTERIOR SUTURAL OPACITIES IN 
HETEROZYGOTES; 


CDRl 


GDB: 119053 


CEREBELLAR DEGENERATION-RELATED 






PR AMrOFPOXrTntJA^^ AT QVXrni? OX/TP- PPM Q 








CHM 


GDB: 120400 


CHOROIDEREMIA; CHM 


CHR39C 


GDB: 119779 


CHOLESTEROL REPRESSIBLE PROTEIN 
39C; CHR39C 
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CIDX 


GDB: 127736 


SEVERE COMBINED IMMUNODEFICffiNCY 
DISEASE X-LINKED. 2: SCIDX2 


CLA2 


GDB: 119782 


CEREBELLAR ATAXIA, X-LINKED; CLA2 


CLCN5 


GDB:270667 


CHLORIDE CHANNEL 5; CLCN5 FANCONI 

NEPHROCALCINOSIS AND RENAL STONES 
NEPHROLITHIASIS, X-LINKED RECESSIVE, 
WITH RENAL FAILURE; XRN 


CLS 


GDB: 11 9784 


RIBOSOMAL PROTEIN S6 KINASE, 90 KD, 
POLYPEPTIDE 3; RPS6KA3 COFFIN-LOWRY 
SYNDROME; CLS 


CMTX2 


GDB: 1283 11 


CHARCOT-MARIE-TOOTH NEUROPATHY, 
X-LINKED RECESSIVE, 2; CMTX2 


CMTX3 


GDB:128151 


CHARCOT-MARIE-TOOTH NEUROPATHY, 
X-LINKED RECESSIVE 3- CMTX3 






DFRMOTDS OF CORNEA- CND 




nr»R'i 107R7 


rONF DYSTROPHY X-LINKED T CODl 


COD2 


GDB:6520166 


CONE DYSTROPHY, X-LINKED, 2; COD2 


COL4A5 


GDB: 120596 


COLLAGEN, TYPE IV, ALPHA-5 CHAIN; 
COL4A5 LEIOMYOMATOSIS, ESOPHAGEAL 
AND VULVAL, WITH NEPHROPATHY 


COL4A6 


GDB:222775 


COLLAGEN, TYPE IV, ALPHA-6 CHAIN; 
COL4A6 LEIOMYOMATOSIS, ESOPHAGEAL 
AND VULVAL, WITH NEPHROPATHY 


CPX 


GDB: 1 20598 


CLEFT PALATE, X-LINKED; CPX 
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CVDl 


GDB:9954659 


CARDIAC VALVULAR DYSPLASL\, 
X-LINKED 


CYBB 


GDB:120513 


GRANULOMATOUS DISEASE, CHRONIC; 


DCX 


GDB:9823272 


LISSENCEPHALY, X-LINKED 


DFN2 


GDB: 119091 


DEAFNESS, X-LINKED 2, PERCEPTIVE 
CONGENITAL; DFN2 


DFN4 


GDB:433255 


DEAFNESS, X-LINKED 4. CONGENITAL 
SENSORINEURAL; DFN4 


DFN6 


GDB: 1320698 


DEAFNESS, X-LINKED. 6, PROGRESSIVE; 


DHOF 


GDB:1 19847 


FOCAL DERMAL HYPOPLASIA; DHOF 


DIAPH2 


GDB:9835484 


DIAPHANOUS. DROSOPHILA, HOMOLOG 
OF, 2 DKCIGDB:! 19096 DYSKERATOSIS 
CONGENITA; DKC 


DMD 


GDB: 119850 


MUSCULAR DYSTROPHY, 
PSEUDOHYPERTROPHIC PROGRESSP/E, 


DSS 


GDB:433750 


DOSAGE-SENSITIVE SEX REVERSAL; DSS 


DYT3 


GDB: 11 8789 


TORSION DYSTONIA-3, X-LINKED TYPE; 
DYT3 


EBM 


GDB: 1 19102 


BULLOUS DYSTROPHY, HEREDITARY 
MACULAR TYPE 


EBP 


GDB: 12521 2 


CHONDRODYSPLASIA PUNCTATA, 
X-LINKED DOMINANT; CDPX2; CDPXD; 
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CPXD 


EDI 


GDB: 119859 


ECTODERMAL DYSPLASIA, ANHIDROTIC; 
EDA 


ELKl 


GDB: 119867 


ELKl, MEMBER OF ETS ONCOGENE 
FAMILY; ELKl 


EMD 


GDB:119108 


MUSCULAR DYSTROPHY, TARDIVE, 
DREIFUSS-EMERY TYPE, WITH 
CONTRACTURES 


EVR2 


GDB: 136068 


EXUDATIVE VITREORETINOPATHY, 
FAX/TTT TAT V_T TNTK'Fr) RFCFJSSIVE* EVR2 


■CO/** 






ry 




HFIV/TDPHTT TA Tl- HFMB 


r<_r 1 




F.PFT T PRODTIPTTON X-LTNKED* FCPX 


FDPSL5 


GDB: 11 9922 


SYNTHETASE-5; FPSL5 


FGDl 


GDB:1 19131 


SYNDROME FACIOGENTTAL DYSPLASIA; 


FGSl 


GDB:9836950 


FG SYNDROME 


FMRl 


GDB: 1 29038 


FRAGILE SITE MENTAL RETARDATION- 1; 
FMRl 


FMR2 


GDB:141566 


FRAGILE SITE, FOLIC ACID TYPE, RARE, 
FRA(X)(q28); FRAXE 


G6PD 


GDB: 120621 


GLUCOSE-6-PHOSPHATE 
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DEHYDROGENASE; G6PD 


GABRA3 


GDB: 119968 


GAMMA- AMINOBUTYRIC ACID RECEPTOR, 
ALPHA -3- fiABRA3 


GATAl 


GDB: 125373 


GATA-BINDING PROTEIN 1; GATAl 


GDIl 


GDB: 1347097 


GDP DISSOCIATION INHIBITOR 1; GDIl 
MENTAL RETARDATION, X-LINKED 


GDXY 


GDB:9954629 


DYSGENESIS, XY FEMALE TYPE; GDXY 




CjDB: 125246 


CHARCOT-MARIE-TOOTH PERONEAL 
MUSCULAR ATROPHY, X-LINKED; CMTXl 
GAP JUNCTION PROTEIN, BETA-1, 32 KD; 

VJJX3 1 




OFIPl-l 1 0971 


MVPFUriT VCFROT PA/fTA 
n. 1 X rLrxvJJ-* i rZf xx v./ LCrxVi j-rv 


GLA 


GDB: 119272 


ANGIOKERATOMA, DIFFUSE 


GPC3 


GDB:3770726 


GLYPICAN-3; GPC3 SIMPSON DYSMORPHIA 
SYNDROME; SDYS 


GRPR 


GDB: 128035 


GASTRIN-RELEASING PEPTIDE RECEPTOR; 


GTD 


GDB:9954635 


GONADOTROPIN DEFICIENCY; GTD 


GUST 


GDB:9954655 


MENTAL RETARDATION WITH OPTIC 
ATROPHY, DEAFNESS, AND SEIZURES 


HMSl 


GDB:251827 


1;HMS1 
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HPRTl 


GDB:1 19317 


HYPOXANTHINE GUANINE 
PHOSPHORIBOSYLTRANSFERASE 1; HPRTl 


HPT 


GDB:119322 


HYPOPARATHYROIDISM, X-LINKED; HYPX 


HTC2 


GDB:700980 


HYPERTRICHOSIS, CONGENITAL 
GENERALIZED; CGH; HCG 


HTR2C 


GDB:378202 


5-@HYDROXYTRYPTAMINE RECEPTOR 2C; 
HTR2C 


HVR 

XI I Xv 


GDB-9954625 


REGULATOR; HYR 




GDB- 120521 


MUCOPOLYSACCHARIDOSIS TYPE H 


IHGl 


GDB: 119343 


HYPOPLASIA OF, WITH GLAUCOMA; IHG 


iL2KLr 




TMTFRI ETJKJN-2 RECEPTOR. GAMMA: 
IL2RG SEVERE COMBINED 
IMMUNODEFICIENCY DISEASE, X-LINKED, 
2; SCIDX2 


INDX 


GDB:9954657 


IMMUNONEUROLOGIC DISORDER, 
X-LINKED 


IPl 


GDB: 120105 


INCONTINENTLY PIGMENTI. TYPE I; IPl 


IP2 


GDB:120106 


INCONTINENTLY PIGMENTI, TYPE II; IP2 


JMS 


GDB:204055 


MENTAL RETARDATION, X-LINKED, WITH 
GROWTH RETARDATION, DEAFNESS, AND 


KALI 


GDB:120116 


KALLMANN SYNDROME 1; KALI 


KFSD 


GDB:128174 


KERATOSIS FOLLICULARIS SPINULOSA 
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DECALVANS CUM OPHIASI; KFSD 


LICAM 


GDB: 120133 


CLASPED THUMB AND MENTAL 
RETARDATION LI CELL ADHESION 
MOLECULE; LICAM 


LAMP2 


GDB: 125376 


LYSOSOME-ASSOCL\TED MEMBRANE 
PROTEIN B; LAMP2; LAMPB 


MAA 


GDB: 11 9372 


MICROPHTHALML\ OR ANOPHTHALMOS, 
WITH A^^nrTATFr) AMOMAT TF^- MAA 












MOIMOAMIMF OYTDASF A- MAOA 


MAOB 


GDB: 119377 


MONOAMINE OXIDASE B; MAOB 


MCF2 


GDB:120168 


MCF.2 CELL LINE DERIVED 
TRANSFORMING SEQUENCE; MCF2 


MCS 


GDB: 128370 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-4, WITH CONGENITAL 
CONTRACTURES 


ME AX 


GDB: 11 9383 


X-LINKED, WITH EXCESSIVE AUTOPHAGY; 
XMEA- MEAX 


MErP2 


GDB-3851454 


SYNDROME* RTT 


MF4 


GDB* 119386 


METACARPAL 4-5 FUSION- MF4 


MGCl 


GDB: 1201 79 


MEGALOCORNEA; MGCl; MGCN 


MIC5 


GDB: 120526 


SURFACE ANTIGEN, X-LINKED; SAX 
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MIDI 


GDB:9772232 


OPrrZ SYNDROME 


MLLT7 


GDB:392309 


MYELOID/LYMPHOID OR MIXED-LINEAGE 
LEUKEMIA, TRANSLOCATED TO, 7; MLLT7 


MLS 


GDB:262123 


MICROPHTHALMIA WITH LINEAR SKIN 
DEFECTS; MLS 


MRSD 


GDB: 119398 


MENTAL RETARDATION, SKELETAL 
DYSPLASIA, AND ABDUCENS PALSY; 


MRX14 


GDB:138453 


RETARDATION, X-LINKED 14; MRX14 


MRXl 


GDB: 120193 


MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, TYPE 1; MRXl 


MRX20 


GDB:2 17050 


MENTAL RETARDATION, X-LINKED 20; 
MRX20 


MRX2 


GDB:120194 


RETARDATION, X-LINKED NONSPECIFIC, 
TYPE2;MRX2 


MRX3 


GDB:128105 


GDP DISSOCIATION INHIBITOR 1; GDIl 

MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, TYPE 3; MRX3 


MRX40 


GDB:700754 


MENTAL RETARDATION, X-LINKED, WITH 
HYPOTONIA 


MRXA 


GDB:9954641 


MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, WITH APHASIA; MRXA 


MSD 


GDB: 11 9399 


SYNDROME 


MTMl 


GDB:1 19439 


MYOTUBULAR MYOPATHY 1; MTMl 
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MYCL2 


GDB: 120209 


MYCL-RELATED PROCESSED GENE; 


MVPl 

lYl I JT 1 


OnR- 19778'^ 

vJiyo . i ^ / I 


MYOPIA Y-T INKED- MYPl 






XrOPPTP T^TQFA^F- TsTHP 
iNWiVlvJLtl JL/loErVoJC, INJL/l^ 


NHS 


GDB: 120235 


CATARACT-DENTAL SYNDROME 


NPHLl 


GDB:433705 


NEPHROLITHIASIS, X-LINKED RECESSIVE, 
WITH RENAL FAILURE; XRN 


NROBl 


GDB: 118982 


ADRENAL HYPOPLASIA, CONGENITAL; 


JNoA 






NYSl 


GDB: 119458 


NYSTAGMUS, X-LINKED; NYS 


NYX 


GDB: 119814 


NIGHTBLINDNESS, CONGENITAL 

O 1 A 1 lUJN AK X , W 1 1 rl JVl I yJriA, v^oiNi5 1 


OAl 


GDB: 119459 


ALBINISM, OCULAR, TYPE 1; OAl 


OASD 


GDB: 138457 


OCULAR, WITH LATE-ONSET 
SENSORINEURAL DEAFNESS; OASD 


OCRL 


GDB: 119461 


LOWE OCULOCEREBRORENAL 


ODTl 


GDB- 125360 


TEETH ABSENCE OF 


OFDl 


GDB: 120248 


OROFACIODIGITAL SYNDROME 1; OFDl 
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nPA2 


GDB: 125358 


OPTIC ATROPHY 2; 0PA2 


OPDl 


GDB: 120249 


OTOPALATODIGITAL SYNDROME 


OPEM 


GDB: 119467 


OPHTHALMOPLEGIA, EXTERNAL, AND 
MYOPL\; OPEM 


OPNILW 


GDB: 120724 


COLORBLINDNESS, PARTIAL, PROTAN 
SERIES; CBP 


OPNIMW 


GDB: 120622 


COLORBLINDNESS, PARTIAL, DEUTAN 
SERIES; CBD; DCB 


OTC 


GDB: 119468 


ORNITHINE TRANSCARBAMYLASE 
DEFICIENCY, HYPERAMMONEMIA DUE TO; 
OTC 


P3 


GDB:9954667 


PROTEIN P3 


PDHAl 


GDB: 118895 


PYRUVATE DEHYDROGENASE COMPLEX, 
El -ALPHA POLYPEPTIDE- 1: PDHAl 






AMYLOIDOSIS, FAMILIAL CUTANEOUS 


i It 


GDB' 120275 


PROPERDIN DEFICIENCY, X-LINKED 


PFKFBl 


GDB- 125375 


6-@PHOSPHOFRUCTO-2-KINASE; PFKFBl 


PGKl 


GDB 120282 


PHOSPHOGLYCERATE KINASE 1; PGKl 


PGKIPI 


GDB:120283 


PHOSPHOGLYCERATE KINASE 1; PGKl 


PGS 


GDB: 128372 


DANDY-WALKER MALFORMATION WITH 
MENTAL RETARDATION, BASAL GANGLL\ 
DISEASE, 
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PHEX 


GDB: 120520 


HYPOPHOSPHATEMIA, VITAMIN 
D-RESISTANT RICKETS; HYP 


PHKAl 


GDB: 120285 


PHOSPHORYLASE KINASE, ALPHA 1 
SlIRirMTT rMTJSCLEV PHKAl 


PHI? A*? 




ni vpnnFTM storage dtsfasf vttt 


PHP 


GDB: 119494 


PANHYPOPITUITARISM; PHP 


PIGA 


GDB: 138 138 


PHOSPHATIDYLINOSITOL GLYCAN, CLASS 


PT PI 




PPOTFOT TPm PROTFTTsJ A/TVFT fW* PT P 






PRFM ATT TPF OVARTAM FATT TTRF 1 • POFl 


POLA 


GDB: 120304 


POLYMERASE, DNA, ALPHA; POLA 


POU3F4 


GDB:351386 


DEAFNESS, CONDUCTIVE. WITH STAPES 
FIXATION 


PPMX 


GDB:9954669 


RETARDATION WITH PSYCHOSIS, 
PYRAMIDAL SIGNS, AND 


PRD 


GDB:371323 


DYSPLASIA, PRIMARY 


PRPSl 


GDB: 1203 18 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-I; PRPSl 


PRPS2 


GDB: 120320 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-I!- PRPS2 


PRS 


GDB: 128368 


MENTAL RETARDATION, X-LINKED, 
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SYNDROMIC-2, WITH DYSMORPHISM AND 
CEREBRAL 


PRTS 


GDB:128367 


PARTINGTON X-LINKED MENTAL 
RETARDATION SYNDROME: PRTS 






TRAM^PORTFR 2 ABC' TAP2 


RENBP 


GDB: 133792 


RENIN-BINDING PROTEIN; RENBP 


RENSl 


GDB:9806348 


MENTAL RETARDATION, X-LINKED, 


i\jrZ 




PFTnsJTTT^ PTOMFMTOSA-2- RP2 








RPGR 


GDB: 11 8736 


RETINITIS PIGMENTOSA-3; RP3 


RPS4X 


GDB: 128 115 


RIBOSOMAL PROTEIN S4. X-LINKED; 
RPS4X 


RPS6KA3 


GDB:365648 


RIBOSOMAL PROTEIN S6 KINASE, 90 KD, 
POT VPFPTTHF RPS6KA3 






RFTTNOSCHISIS" RS 


Sll 


GDB:120361 


ANTIGEN, X-LINKED, SECOND; SAX2 


SDYS 


GDB: 119590 


GLYPICAN-3; GPC3 SIMPSON DYSMORPHIA 
SYNDROME; SDYS 


SEDL 


GDB:120372 


SPONDYLOEPIPHYSEAL DYSPLASIA, 
LATE; SEDL 
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SERPINA7 


GDB: 120399 


tf M-J* u .* ij^i .^^tt ,i Jii ..1' lUi* lUi 

THYROXINE-BINDING GLOBULIN OF 
SERUM; TBG 


SH2D1A 


GDB: 120701 


IMMUNODEFICIENCY, X-LINKED 
PROGRESSIVE COMBINED VARIABLE 


SHFM2 


GDB:226635 


SPLIT-HAND/SPLIT-FOOT ANOMALY, 


SHOX 


GDB:6 118451 


SHORT STATURE; SS 


SLC25A5 


GDB: 125 190 


ADENINE NUCLEOTIDE TRANSLOCATOR 2; 
ANT2 


SMAX2 


GDB:9954643 


SPINAL MUSCULAR ATROPHY, X-LINKED 

T cm AT TTVITTAMTTT TJ 


SRPX 


GDB:3811398 


RETINITIS PIGMENTOSA-3; RP3 


SRS 


GDB:136337 


MENTAL RETARDATION, X-LINKED, 

CXrVT^Tm T> /^ti TXTO /^XT TnX/TOT? 

oJN YJJiiK-KUdlNoUN lY rt» 


CTC 




iCxl i rl Y(Joio, A-LlNlsJiiJ 


^ I IN I 




O I IN AroUN 1, o I JN 1 


SYP 


GDB: 125295 


SYNAPTOPHYSIN; SYP 


TAFl 


GDB: 120573 


TATA BOX BINDING PROTEIN 
(TBP)-ASSOCIATED FACTOR 2A; TAF2A 


TAZ 


GDB:120609 


CARDIOMYOPATHY, DILATED 3A; CMD3A 
ENDOCARDTAI FIRROFI ASTOSIS-2- FFF7 


TBX22 


GDB: 10796448 


CLEFT PALATE, X-LINKED; CPX 
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TDD 


GDB: 119610 


MALE PSEUDOHERMAPHRODITISM: 
DEFICIENCY OF TESTICULAR 
17,20-DESMOLASE; 


TFE3 


GDB: 125870 


TRANSCRIPTION FACTOR FOR 
IMMUNOGLOBULIN HEAVY-CHAIN 
ENHANCER-3; TFE3 


THAS 


GDB: 128 158 


THORACOABDOMINAL SYNDROME; TAS 


THC 


GDB: 125361 


THROMBOCYTOPENL\, X-LINKED; THC; 
XLT 


TIMM8A 


GDB:1 19090 


DEAFNESS 1, PROGRESSIVE; DFNl 


TIMPl 


GDB:1 19615 


TISSUE INHIBITOR OF 
METALLOPROTEINASE-1; TIMPl 


TKCR 


GDB:1 19616 


TORTICOLLIS, KELOIDS, 
CRYPTORCHIDISM. AND RENAL 
DYSPLASIA; TKC 


TNFSF5 


GDB: 120632 


IMMUNODEFICIENCY WITH INCREASED 
IgM 


UBEl 


GDB: 118954 


UBIQUITIN-ACTIVATING ENZYME 1; UBEl 


UBE2A 


GDB: 13 1647 


UBIQUITDSf-CONJUGATING ENZYME E2A; 
UBE2A 


WAS 


GDB: 120736 


WISKOTT-ALDRICH SYNDROME; WAS 


WSN 


GDB:125864 


PARKINSONISM, EARLY-ONSET. WITH 
MENTAL RETARDATION 


WTS 


GDB: 128373 


MENTAL RETARDATION, X-LINKED. 
SYNDROMIC-6, WITH GYNECOMASTIA 
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AND OBESITY- 


wws 


GDB: 120497 


WIEACKER SYNDROME 


XIC 


GDB: 120498 


X-INACTIVATION-SPECIFIC TRANSCRIPT; 

XIST 


XIST 


GDB: 126428 


X-INACTIVATION-SPECIFIC TRANSCRIPT; 






W T OPT 


XM 


GDB: 1 19634 


XM SYSTEM 


XS 


GDB: 119636 


LUTHERAN SUPPRESSOR, X-LINKED; XS; 






ZTMC FTNGFR PROTEIN X-LTNKED' ZFX 


ZIC3 


GDB:249141 


HETEROTAXY, X-LINKED VISCERAL; HTXl 


ZNF261 


GDB:9785766 


MENTAL RETARDATION, X-LINKED; 
DXS6673F 


ZNF41 


GDB: 125865 


ZnsfC FINGER PROTEIN-41; ZNF41 


ZNF6 


GDB: 120508 


ZINC FINGER PROTEIN-6; ZNF6 



Table 26: Genes, Locations and Genetic Disorders on Chromosome Y 



Gene 


GDB Accession ID 


OMIM Link 


AMELY 


GDB: 119676 


AMELOGENIN, Y-CHROMOSOMAL; AMELY 
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ASSP6 


GDB: 119020 


CITRULLINEML\ 


AZFl 


GDB: 119027 


AZOOSPERML\ FACTOR 1; AZFl 


AZF2 


GDB:456131 


AZOOSPERMM. FACTOR 2; AZF2 


DAZ 


GDB:635890 


DELETED IN AZOOSPERML\; DAZ 


GCY 


GDB: 119267 


CONTROL, Y-CHROMOSOME INFLUENCED; 
GCY 


RPS4Y 


GDB: 128052 


RTOOSOMAL PROTEIN S4, Y-LINKED; RPS4Y 


SMCY 


GDB:5875390 


HISTOCOMPATIBILITY Y ANTIGEN; HY; HYA 


SRY 


GDB: 125556 


SEX-DETERMDSriNG REGION Y; SRY 


ZFY 


GDB: 120503 


ZINC FINGER PROTEIN. Y-LINKED; ZFY 


Table 27: 


Genes, Locations and Genetic Disorders in Unknown or Multiple Locations 


Gene 


GDB Accession ID 


OMIM Link 


ABAT 


GDB:581658 


GAMMA-AMINOBUTYRATE TRANSAMINASE 


AEZ 


GDB: 128360 


ACRODERMATITIS ENTEROPATHICA, 
ZINC-DEFICIENCY TYPE; AEZ 


AFA 


GDB:265277 


FILIFORME ADNATUM AND CLEFT PALATE 


AFDl 


GDB:265292 


DYSOSTOSIS, TREACHER COLLINS TYPE, 
WITHLIMB ANOMALIES 
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AGSl 


GDB:10795417 


ENCEPHALOPATHY, FAMILIAL INFANTILE, 
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COLONY STIMULATING FACTOR 2 
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GRANULOCYTE-MACROPHAGE 
COLONY-STIMULATING FACTOR RECEPTOR, 
ALPHA SUBUNIT, 
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CARPAL TUNNEL SYNDROME; CTS; CTSl 
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ISOLATED NONKETOTIC, TYPE IV; NKH4 
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HHG 
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ISOLEUCINE; MTTI 


MTTK 


GDB:118936 


MERRF SYNDROME TRANSFER RNA, 
MITOCHONDRIAL. LYSINE; MTTK 


MTTLl 


GDB:118937 


MERRF SYNDROME TRANSFER RNA, 
MITOCHONDRIAL, LEUCINE, 1; MTTLl 


MTTL2 


GDB:118938 


TRANSFER RNA, MITOCHONDRIAL, 
LEUCINE. 2; MTTL2 


MTTN 


GDB: 118940 


TRANSFER RNA, MITOCHONDRIAL, 
ASPARAGINE; MTTN 


MTTP 


GDB: 118941 


TRANSFER RNA, MITOCHONDRIAL, 
PROLINE; MTTP 


MTTSl 


GDB: 118944 
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SYNDROME 
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PHOSPHOENOLPYRUVATE 
CARBOXYKINASE 2, MITOCHONDRIAL; PCK2 
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GDB:433949 


POLYCYSTIC LIVER DISEASE; PLD 
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GLYCOGEN STORAGE DISEASE VII 
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TOY 




SULFOCYSTEINURIA 


TC21 


GDB:5573831 


ONCOGENE TC21 



-239- 



wo 2004/001010 PCT/US2003/019760 



1 xXlVl 
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TRICHOTHIODYSTROPHY; TTD 
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E quivalents: 

The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described will become apparent to those skilled in the art from the foregoing 
description and accompanying figures. Such modifications are intended to fall within the 
scope of the appended claims. 

Various publications are cited herein, the disclosures of which are 
incorporated by reference in their entireties. 
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1 . A method of identifying a compound that modulates premature translation 
tennination or nonsense-mediated mRNA decay, said method comprising: 

(a) contacting a member of a library of compounds with a cell containing a first 
5 nucleic acid sequence and a second nucleic acid sequence, wherein the first 

nucleic acid sequence comprises a regulatory element operably linked to a 
reporter gene and the second nucleic acid sequence comprises a nucleotide 
sequence with a premature stop codon that encodes a regulatory protein that 
binds to the regulatory element of the first nucleic acid sequence and 
10 regulates the expression of the reporter gene; and 

(b) detecting the expression of the reporter gene, wherein a compound that 
modulates premature translation termination or nonsense-mediated mRNA 
decay is identified if the expression of the reporter gene in the presence of 
the compound is altered relative to the expression of the reporter gene in the 

1 5 absence of the compound or the presence of a negative control. 

2. A method of identifying a compoxmd that modulates premature translation 
termination or nonsense-mediated mRNA decay, said method comprising: 

(a) contacting a member of a library of compounds with a cell containing a first 
nucleic acid sequence, a second nucleic acid sequence and a third nucleic 

20 acid sequence, wherein (i) the first nucleic acid sequence comprises a 

nucleotide sequence encoding a first fiision protein comprising a DNA 
binding domain and a first protein, the nucleotide sequence of the first 
protein containing a premature stop codon, (ii) the second nucleic acid 
sequence comprises a nucleotide sequence encoding a second fiision protein 

25 comprising an activation domain and a second protein, the second protein 

interacting with the first protein to produce a regulatory protein, and (iii) the 
third nucleic acid sequence comprises a regulatory element operably linked 
to a reporter gene, the expression of the reporter gene being regulated by the 
binding of the regulatory protein to the regulatory element; and 

30 (b) detecting the expression of the reporter gene, wherein a compound that 

modulates premature translation termination or nonsense-mediated mRNA 

decay is identified if the expression of the reporter gene in the presence of 
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the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. 



3. A method of identifying a compound that modulates premature translation 
termination or nonsense-mediated mRNA decay, said method comprising: 

(a) contacting a member of a library of compounds with a cell containing a first 
nucleic acid sequence, a second nucleic acid sequence and a third nucleic 
acid sequence, wherein (i) the first nucleic acid sequence comprises a 
nucleotide sequence encoding a first fusion protein comprising a DNA 
binding domain and a first protein, (ii) the second nucleic acid sequence 
comprises a nucleotide sequence encoding a second fusion protein 
comprising an activation domain and a second protein, the nucleotide 
sequence of the second protein containing a premature stop codon and the 
second protein interacting with the first protein to produce a regulatory 
protein, and (iii) the third nucleic acid sequence comprises a regulatory 
element operably linked to a reporter gene, the expression of the reporter 
gene being regulated by the binding of the regulatory protein to the 
regulatory element; and 

(b) detecting the expression of the reporter gene, wherein a compound that 
modulates premature translation termination or nonsense-mediated mRNA 
decay is identified if the expression of the reporter gene in the presence of 
the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. 

4. A method for identifying a compound that modulates premature translation 
termination or nonsense-mediated mRNA decay, said method comprising: 

(a) contacting a member of a library of compounds with a cell-free translation 
mixture and a nucleic acid sequence comprising a regulatory element 
operably linked to a reporter gene, wherein the reporter gene contains a 
premature stop codon and the cell-fi-ee translation mixture is isolated from 
cells that have been incubated at about OX to about lO^'C; and 

(b) detecting the expression of the reporter gene, wherein a compound that 

modulates premature translation termination or nonsense-mediated mRNA 

decay is identified if the expression of the reporter gene in the presence of 
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the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. 

5. A method for identifying a compound that modulates premature translation 
termination or nonsense-mediated mRNA decay, said method comprising: 

(a) contacting a member of a library of compounds with a cell-free translation 
mixture and a nucleic acid sequence comprising a regulatory element 
operably linked to a reporter gene, wherein the reporter gene contains a 
premature stop codon and the cell-free translation mixture is a S 10 to S30 
cell-free extract; and 

(b) detecting the expression of the reporter gene, wherein a compound that 
modulates premature translation termination or nonsense-mediated mRNA 
decay is identified if the expression of the reporter gene in the presence of 
the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. 

6. The method of claim 4, wherein the cell-free translation mixture is a S 10 to 
S30 cell-free extract. 

7. The method of claim S, wherein the cell-free translation mixture is a S 12 
cell-free extract. 

8. The method of claim 6, wherein the cell-free translation mixture is a S 12 
cell-free extract. 

9. A method of identifying a compound to be tested for its ability to prevent or 
treat a disease characterized by or associated with the presence of a premature stop codon in 
a gene, said method comprising: 

(a) contacting a member of a library of compounds with a cell containing a 
nucleic acid sequence comprising a reporter gene with a premature stop 
codon; and 

(b) detecting the expression of the reporter gene, 

so that if the expression of the reporter gene in the presence of the compound is 
altered relative to the expression of the reporter gene in the absence of the compound or the 
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presence of a negative control, then a compound to be tested for its ability to prevent or treat 
the disease is identified, wherein the disease is famihal hypercholesterolemia, osteogenesis 
imperfecta, cirrhosis, ataxia telangiectasia or a lysosomal storage disease. 

10. A method of identifying a compound to be tested for its ability to prevent or 
treat a disease characterized by or associated with the presence of a premature stop codon in 
a gene, said method comprising: 

(a) contacting a member of a library of compounds with a cell-free translation 
mixture and a nucleic acid sequence comprising a reporter gene with a 
premature stop codon; and 

(b) detecting the expression of the reporter gene, 

so that if the expression of the reporter gene in the presence of the compound is 
altered relative to the expression of the reporter gene in the absence of the compound or the 
presence of a negative control, then a compound to be tested for its ability to prevent or treat 
the disease is identified, wherein the disease is familial hypercholesterolemia, osteogenesis 
imperfecta, cirrhosis, ataxia telangiectasia or a lysosomal storage disease. 

1 1 . The method of claim 1, 2, 3, 4 or S, wherein the method further comprises 
determining the structure of the compound that suppresses premature translation termination 
or nonsense-mediated mRNA decay. 

12. The method of claim 9 or 10, wherein the method further comprises 
determining the structure of the compound. 

13. The method of claim 1, 2, 3, 4, 5, 9 or 10, wherein the reporter gene is 
firefly luciferase, renilla luciferase, click beetle luciferase, green fluorescent protein, yellow 
fluorescent protein, red fluorescent protein, cyan fluorescent protein, blue fluorescent 
protein, beta galactosidase, beta glucoronidase, beta lactamase, chloramphenicol 
acetyltransferase, or alkaline phosphatase. 

14. The method of claim 1, 2, 3 or 9, wherein the cell is selected from the group 
consisting of 293T, HeLa, MCF7, Wi-38, SkBr3, Jurkat, CEM, THPl, 3T3, and Raw264.7 
cells. 
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15. The method of claim 4, 5 or 10, wherein the cell-free translation mixture is a 
cell-free extract from 293T, HeLa, MCF7, Wi-38, SkBr3, Jurkat, CEM, THPl, 3T3, or 
Raw264.7 cells. 

16. The method of claim 1, 2, 3, 4, 5, 9 or 10, wherein the compound is selected 
5 from a combinatorial Hbrary of compounds comprising peptoids; random biooligomers; 

diversomers such as hydantoins, benzodiazepines and dipeptides; vinylogous polypeptides; 
nonpeptidal peptidomimetics; oligocarbamates; peptidyl phosphonates; peptide nucleic acid 
libraries; antibody libraries; carbohydrate libraries; and small organic molecule libraries. 

17. The method of claim 16, wherein the small organic molecule libraries are 
10 libraries of benzodiazepines, isoprenoids, thiazolidinones, metathiazanones, pyrrolidines, 

morpholino compounds, or diazepindiones. 

1 8. The method of claim 1, 2, 3, 4, 5, 9 or 10, wherein the premature stop codon 
isUAG orUGA. 

19. The method of claim 1 , 2, 3, 4, 5, 9 or 10, wherein the premature stop codon 
1 5 context is UAGA, UAGC, UAGG, UAGU, UG AA, UGAC, UGAG or UGAU, 
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